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PROCESSES OF FEAR REDUCTION IN SYSTEMATIC 
DESENSITIZATION: 
ANIMAL STUDIES* 
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State University of Ne 


and several tentative 
represents an unwarranted neurologiz: 


unsusceptible to extinction 
experimental extinction; 


stimulus (CS), while not 
exposure to the full (traini 
ance, may be inferior to gra 
sponses; (e) pro 
facilitate extinction ra 
of anxiety-competing responses, 
procedure, may be important i 
aversive stimulus. 


Recent years have witnessed an increasingly 
Critical examination of theory and research 
in behavior modification — (e.g. Franks, 
1969), in marked contrast with the allegedly 

defensive" polemics of earlier years (e.g.. 
Eysenck, 1960; Eysenck & Rachman, 1965). 
Interest in behavior modification has been 
high among not only clinicians but also 
among — experimental psychologists. The 
(Over-) eagerness with which clinicians have 
embraced these “new” techniques is perhaps 
understandable in view of the inadequacy of 
those approaches and techniques that have 
held Sway in most training and professional 
settings. Many experimentalists have been at- 
tracted to the field because, in our view, 
behavioral techniques seem to offer the oppor- 
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conclusions are suggested: 
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procedures is inconsistent with the firm findings of 
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tically reviewed with respect 
of systematic desensitization, 
(a) “Reciprocal inhibition” 


“counterconditioning” is in sore 
conditioned 


superior in effectiveness to repeated 


he CS seems, on the whole, to 
ubject; and (f) the use 
e standard desensitization 
forced exposure to the 


tunity to examine issues and problems of 
general psychological importance (viz., Lang, 
1969). 

Probably the technique that has attracted 
the most attention in both circles (though 
we hardly intend to suggest that the circles 
do not intersect—the present authors regard 
themselves as both clinicians and experimen- 
talists) is Wolpe’s systematic desensitization 
(Wolpe, 1952, 1958, 1969). As Paul (1969) 


has recently concluded: 
rst time in the history of psycho- 


logical treatments [that] 2 specific therapeutic pack- 
age reliably produced measurable benefits for clients 
across à broad range of distressing problems in 
which anxiety of fundamental importance 


[p. 159]. 
Moreover, 


[It represents] the fi 


was 


considerable research effort has 
been fruitfully directed to parceling out the 
active ingredients of this procedure; the fact 
that we are still far from reliable answers is 
less important than the fact that much con- 
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trolled research is underway within an 
experimental psychological framework. 

Several reviews have appeared over the past 
few years examining the clinical, experi- 
mental, and theoretical foundations of desen- 
sitization (e.g., Bandura, 1969; Lang, 1969; 
Paul, 1969; Rachman, 1967). Aside from the 
inevitable fact that numerous studies have 
been reported since their writing, we find that 
none of the available reviews comes to grips 
with some of the very basic psychological 
issues raised by research in desensitization 
(although Lang’s and Bandura’s certainly are 
in this spirit). Especially lacking in these re- 
views is a careful consideration of experi- 
mental work with infrahuman organisms deal- 
ing with the more general problem of reducing 
fear and/or avoidance behavior. It seems to 
us peculiarly appropriate to examine these 
experimental bases of desensitization inas- 
much as Wolpe claims to have derived the 
technique from his and others’ work with 
animals. Moreover, many influential writers 
(e.g., Eysenck, 1959) argue that behavior 
therapy as a field is based on “learning 
theory,” which historically has been heavily 
involved in the study of animal behavior. 
Though mindful of the inherent limitations of 
analogue research, we would be remiss and 
mistakenly “humanistic” to ignore the pos- 
sible implications of research conducted, in 
the main, by workers who have never seen 
an anxiety hierarchy. 

This article, then, examines experimental 
research in fear reduction with animals, which 
seems to relate fairly directly to the Opera- 
tions that characterize systematic desensitiza- 
tion, as well as to the theoretical under- 
pinnings of the procedure. A separate paper 


will be devoted to a discussion of experimental 
research with humans. 


THE PRINCIPLE oF RECIPROCAL INHIBITION 


According to Wolpe, the concept of “recip- 
rocal inhibition” is central to the technique 
of systematic desensitization. Originally pos- 
tulated by him as a neurological mechanism 
to explain his early experiments with cats, 
it has acquired considerable surplus meaning 
developing into one of behavior therapy’s 
most frequently touted slogans. Thus, it has 


been referred to as a “method” (Rachman, 
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1967), a “learning process” (Rachman, 
1965), an “explanation” (Lomont, 1965; 
Rachman, 1967), a “complex neural phe- 
nomenon” (Wolpe, 1962), and a "therapeutic 
technique” (Eysenck, 1960, 1964). Obviously 
such obfuscation contributes little to an 
understanding either of the concept or of the 
technique(s) that it is invoked to explain. 
In our opinion, it has become a panchreston, 
with all the unfortunate consequences at- 
tendant upon a confusion of process with 
procedure (cf. Evans & Wilson, 1968). 

Systematic desensitization, although not 4 
novel technique, was evolved and promulgated 
in its present form by Wolpe largely on the 
basis of his studies on the experimental 
induction and elimination of neurotic fears 
in cats. He found that analogous to the 
original clinical work of Jones (1924), condi- 
tioned emotional responses could be elimi- 
nated by feeding animals gradually closer to i 
the locus of the fear conditioning. The main 
notion was that there is always a point along 
the generalization gradient at which the anx- 
iety elicited is not so severe as to prevent the 
elicitation of a physiologically incompatible 
response, like eating (presumed to be in- 
compatible) —a response that inhibits the 
simultaneous occurrence of the anxiety. The. 
repeated reciprocal inhibition of the fear 
responses is said to result in their permanent 
conditioned inhibition. The conceptual frame- 
work is Hull’s (1943), with “reciprocal inhibi- 
tion" regarded as functionally equivalent to 
"reactive inhibition" (see below). 

In a massive extrapolation, Wolpe sub- 
stituted imaginal representation of the 4 
feared conditioned stimulus (CS) for actual 
exposure, and substituted Jacobson’s (1938) 
training in muscle relaxation for feeding. 

Wolpe adapted from Sherrington (1906) 
the term “reciprocal inhibition.” Sherrington $ 
usage referred to the suppression of one "è i 
by the evocation of another. Thus, reflex » 
relaxation of the extensor muscles of the arms 
accompanies contraction of the flexor muscles, 
there being a reciprocal relaxation of the 
flexors when the extensors contract. As We 
have pointed out elsewhere (Davison, 1968). 
Wolpe's postulation of reciprocal inhibition 4°. 
the very complex levels of neural and behav- 
ioral organization that he is concerned with 
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represents an extreme and probably un- 
justified extension of the term as originally 
employed by Sherrington. 

l Wolpe explicitly rejected experimental ex- 
tinction (i.e., “The specific procedure of pre- 
senting the conditioned stimulus unaccompa- 
nied by the usual reinforcement; also the 
decrement in a conditioned response which 
results from that procedure [Kimble, 1961, 
D. 479]”) as a viable procedural description 
of systematic desensitization. He argued that 
exposure alone is ineffective as far as anxiety 
Tesponses are concerned (Wolpe, 1958, p. 71); 
citing Miller (1951) as showing that a fear- 
motivated motor habit required hundreds of 
trials to extinguish. Aside from the fact 
that Miller's data demonstrated the possible 
"efficiency of repeated exposure—not inej- 
Jectiveness—a more detailed consideration of 
recent data demonstrates that conditioned 
avoidance responses can, indeed, be rapidly 
€xtinguished via exposure to the CS under 
appropriate conditions (cf. Baum, 1970; 
Katzev, 1967; Shearman, 1968; Solomon, 
Kamin, & Wynne, 1953, Experiment 5; Weiss 
& Monahan, 1969). 

Moreover, beyond these empirical consid- 
erations, Wolpe's theoretical reasons for dis- 


~ Missing sheer exposure seem to be erroneous. 


The slow extinction of anxiety responses was 
attributed by him to the "fact" that auto- 
Nomic responses generate only à small amount 
Of reactive inhibition. Apart from the prob- 
lems inherent in conceptualizing anxiety as 
solely autonomic (cf. Schachter & Singer, 
1962), the basic logic of the Hullian fatigue 
theory of extinction—which Wolpe explicitly 
ACCepts. has been severely embarrassed (e.g. 
Adams, 1963; Deutsch, 1960; Gleitman, 
Nachmias, & Neisser, 1954; Jensen. 1961). 
Indeed, to return to data, Lomont (1965) 
has pointed out that in Wolpe’s original 
experiments there was no control group that 
Was gradually exposed to the locus of shock 
Without being fed. In fact, Wolpe “cured” 
three cats by a “forced solution” method, 
entailing prevention of the avoidance response 
In order to forcibly expose the animals to 


mathe Cs. (This datum conflicts sharply with 


em : j 
d pus dicta against exposing clients to 
Versive imaginal and real-life stimuli once 


anxiety has been elicited; cf. our discussion 
below of the “resensitization hypothesis.” ) 
Thus, Wolpe’s rejection of the possible 
effectiveness of reexposure without explicitly 
provided antagonistic responses might be 
accounted for by the following chain of three 
erroneous assumptions leading to an invalid 
conclusion: (a) Extinction takes place exclu- 
sively via buildup of a negative habit, s/r, 
which (b) requires generation of an aversive 
drive, Zr, from the response to be eliminated; 
(c) anxiety, as primarily or solely an auto- 
nomic (sympathetic) response, generates little 
Ir, so that (d) therefore a specific incompati- 
ble response must be elicited to work against 
the simultaneous evocation of the fear re- 
sponse, providing reciprocal inhibition, a 
functional equivalent of reactive inhibition, as 
a basis for the acquisition of the negative 
habit, s/r. The very questionable validity of 
Steps 1-3 renders the conclusion (d) invalid. 
It appears, then, that Wolpe’s celebrated 
formulation of the reciprocal inhibition prin- 
ciple constituted an unwarranted and prob- 
ably also unparsimonious transformation of a 
narrow physiological principle to explain very 
complex and occasionally insufficiently con- 
trolled behavioral data. This certainly does 
not detract from the considerable practical 
significance of the work attendant on Wolpe’s 
speculations, but it may serve to allow us to 
break set in order to examine carefully the 
experimental literature that is relevant to an 
understanding of fear reduction as it relates 


to desensitization. 
LITERATURE REVIEW 


Systematic desensitization involves at least 
the following crucial elements: (a) Exposure 
to aversive stimuli, (5) gradations in this 


exposure from least to most anxiety pro- 


voking, and (c) pairing of anxiety-competing 
responses. with graded exposure to aversive 


stimuli. 
Ungraded Exposure to Aversive Stimuli Paired 
and Unpaired with Anxiety-Com peting 
Responses 

Lomont (1965) has reviewed animal experi- 
ments in which comparisons were made in 


efficacy of fear reduction via repeated expo- 
sure to the CS at training intensity versus 


ks 


pairing of such exposures with anxiety- 
competing responses. Farber (1948) found 
that feeding in the presence of the CS elimi- 
nated conditioned fear, but lacking in the 
study was an appropriate control group in 
which animals received exactly the same 
exposure opportunities without the feeding. 
Moltz (1954) replicated Farber and added 
this necessary control, finding that feeding in 
the presence of the CS did not enhance fear 
reduction beyond merely confining rats in 
the fear-provoking situation. Sermat and 
Shephard (1959) found that feeding pro- 
duced only a weak and transient reduction 
in a conditioned avoidance response (CAR) 
when compared with a group merely confined 
in the nonreinforced presence of the CS. Lane 
(1954) and Hall (1955) obtained results 
indicating that feeding and the presence of 
a fearless cage-mate, respectively, produced 
greater fear reduction than controls confined 
for the same time in the presence of the CS. 
However, in both these studies, the greater 
movement of the experimental animals might 
have produced greater reduction in fear solely 
in terms of increased unreinforced exposure to 
conditioned fear cues in the apparatus. 
From all these earlier studies Lomont con- 
cluded that “reciprocal inhibition” fails to 
receive appreciable support as a mode of 
anxiety reduction superior to mere repeated 
exposure to the CS. However, a later study 
by Gale, Sturmfels, and Gale (1966), he 
asserted, did provide such evidence. Using a 
classical conditioning (conditioned emotional 
response—CER) rather than a CAR para- 
digm, these workers found that providing 
"food as a stimulus for a response incom- 
patible with fear" reduced fear more quickly 
than mere exposure to the same graded CS 
series. The CER paradigm avoided the prob- 
lems of increased exposure due to greater 
movement in one group versus the other, but 
the use of defecation as the dependent vari- 
able when food is given to one group and not 
to the other raises problems of interpretation. 
This could be handled by a control in which 
food is provided unpaired with anxiety- 
eliciting stimuli. Of more fundamental impor- 
tance perhaps is the fact that they did not 
include a regular extinction control 


| group in 
which the CS was presented at is. 


original 
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intensity. Without this group it is impossible 
to conclude that “extinction” is less effective 
than “reciprocal inhibition.” Indeed, Poppen 
(1968), as discussed below, has shown that a 
hierarchy does not produce quicker extinction 
than regular CS presentation within the CER 
paradigm. Since Gale et al. obtained no sig- 
nificant differences between their experimental 
groups on posttreatment fest trials, we cannot 
conclude that either of their groups is more 
effective than regular extinction. In con- 
trast to Lomonts advised caution, Rach- 
man’s (1967) statement that this particular 
study “clearly indicates the superiority of 


the inhibition technique [p. 98]" seems 
unjustified. 
Recent experiments by Nelson (1966, 


1967) contain controls that shed new light on 
the possible role of stimuli that supposedly 


elicit anxiety-competing responses. In the first , 
study (Nelson, 1966) rats were conditioned to 


a CS, following Lane (1954), and then were 
confined in the fear compartment of an ap- 
paratus like that introduced by Miller (1948), 
with or without food (when hungry), and 
were finally tested for degree of residual fear. 
Escape from the fear compartment was pre- 
vented during conditioning and allowed only 


during the last sequence. Level of fear was ^ 


thereby inferred by evaluating its strength 
as a drive for learning a new response. This 
represented an improvement on Lane's design, 
where animals were given shock-escape train- 
ing in the first phase of the experiment, the 
dependent measure being latency of subse- 
quent escapes from the fear compartment. 
Lane had found that rats confined without 
iood crouched at the escape door while the 
animals confined with food present move 
around the compartment. Nelson placed the 
rats directly over the food dish, thereby mini- 
mizing the conditioning of approach response? 
to food that may have confounded the ™* 
surement of fear, as in Lane's experimen’ 
Results indicated that confinement Pe 5^ 
with or without food, reduced fear signifi- 
cantly more than no confinement. Compari- 
sons of food versus confinement groups 
showed the former to be more fearful. In on 
trast to Lane’s findings, Nelson reported tha 
feeding resulted in Jess exposure to the fear 
cues in the shock compartment, since bis pro- 
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cedure of placing hungry animals directly 


, over the food dish resulted in keeping them 


Av 


—_ e 


in the area of the food dish rather than 
roaming around the compartment. 

In a second experiment, Nelson (1966) 
added the following interpolated treatments 
in order to isolate better the relationship be- 
tween exposure to the CS and the presence 
of food in the shock box: (a) Incentive: food 
Present in shock compartment but freedom to 
Move from one side to the other; (b) no 
incentive: same freedom but without food; 
(c) incentive control: confinement in the 
shock box to match the time spent by a yoked 
Tat in the incentive group, except with no food 
Present; and (d) no incentive control: simi- 
larly matched with the no incentive group, 
With no food present. Results reveal that the 
Presence of food facilitated fear reduction by 


„ Serving as an incentive for exposure. 


In the same vein, Nelson (1967) ran a 


Study employing chlorpromazine as a means to 
Compare sheer exposure with exposure facili- 
tated by competing responses. Rats were con- 
ditioned undrugged, given exposure to the 
€ar compartment with or without the drug, 
and then tested undrugged for residual fear. 

€ found that drugged rats were less fearful 
than undrugged animals only when the drug 
resulted in increased exposure to the fear 
Compartment. 

Taken together, Nelson’s three studies indi- 
Cate strongly that the provision of a stimulus 
hat supposedly elicits an anxiety-competing 
response (food, chlorpromazine) facilitates 
extinction of fear through the increased expo- 


"Sure to the CS which it can bring about. 


=x 


When these additional stimulus conditions are 
arranged so as not to provide such increased 
Xposure, as in his first study, they result in 
Poorer reduction of fear. Thus, by focusing 
less on the competing responses and more on 
the stimulus conditions that supposedly effect 
these responses, the importance of feeding (as 
Stressed by Wolpe) is seen in a different light. 
Indeeq, Wolpe (1958) himself reported that 
anxiety in his cats was more rapidly elimi- 
nated when, in feeding them in the shock 


compartment, “the pellets were tossed at 


distributed points than if they were 
nfined to the food box [p. 5712 


Wilson and Dinsmoor (1970) have con- 
cluded that feeding per se can reduce fear. 
They established passive avoidance behavior 
in rats by shocking them for leaving a small 
platform at the end of a grid runway. During 
extinction treatment trials, one group was fed 
on the platform, a regular extinction. group 
was simply placed on the platform, and a 
third group was forcibly exposed to the grid. 
On test trials both the fed and forced expo- 
sure groups showed significantly greater ex- 
tinction than the regular extinction group, 
but apparently did not differ from each other. 
Wilson and Dinsmoor reported that feeding 
the rats tended to decrease rather than in- 
crease exposure to [fear stimuli, and they 
pointed out that neither the reinforcement of 
incompatible motor responses nor freezing can 
explain the results, since these activities would 
have kept the rats on the platform and so 
delayed rather than facilitated extinction in 
the passive avoidance paradigm. 

A dissertation by Heath (1969) purports 
to furnish evidence in favor of the overriding 
importance of sheer exposure as compared to 
“reciprocal inhibition,” but several methodo- 
logical problems make his conclusions doubt- 
ful. In addition to a “disuse” control group, 
he compared the efficacy of two special treat- 
ment procedures in the extinction of avoid- 
ance behavior in a Mowrer-Miller shuttle box, 
tonal CS duration during acquisition being 10 
seconds: “Reciprocal inhibition” rats were 
provided food during five 100-second expo- 
sures to a preprogrammed five-step tonal hier- 
archy, with no escape possible during the 
extended presentations of the generalized CS, 
and “flooding” rats had five 100-second expo- 
sures to the CS at training intensity without 
food available. Since both these groups were 
superior to the no-treatment (“disuse”) group 
and not different from each other, Heath con- 
cluded that exposure alone to the CS was 
the critical factor, with “reciprocal inhibition” 
playing no role. The problems we see are 
(a) the presence of the food tray, from which 
reciprocal inhibition animals could eat, intro- 
duced a difference 1n stimulus conditions 
during treatment as compared to the flooding 
animals; (2) reciprocal inhibition subjects 
had extended and forced exposure to the tonal 
CS, making this an inappropriate analogue of 
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Wolpe's procedure; and (c) the absence of 
a regular extinction condition (repeated expo- 
sure to the 10-second CS at training inten- 
sity) precludes any judgment on the relative 
efficacy of the special treatment procedures. 
There also are serious questions as to 
whether the reciprocal inhibition procedures 
were such as to allow feeding to take place 
at all during treatment; this more general 
problem of pairing is discussed below. 
Gambrill (1967) employed a rather com- 
plex design in an attempt to compare the 
relative effectiveness of reinforcing a re- 
sponse incompatible with a previously ac- 
quired avoidance response on the elimination 
of the latter. After rats were conditioned to 
reliably avoid a shock on a Sidman schedule 
by turning a wheel, certain groups were given 
training in lifting a lever for food. When 
compared with a traditional extinction group, 
superior extinction of one of the “ 
conditioning” groups was manifest only on the 
first test session. Afterward, for the remain- 
ing nine extinction test sessions, no differences 
were observed among the various groups. Her 
conclusion was that providing an instrumental 
appetitive response incompatible with the 
avoidance response was not superior to only 
nonreinforcement in reducing avoidance. 
Counterconditioning and the problem of 


pairing. Gambrill's discussion of these results 
leads to some co 


to make at this 
of the designs i 
to test the effe 
find all these 
operationalize 


counter- 


hey 
in that 


, nxiety/ 
avoidance, One must keep in mind the stan- 
dard desensitization procedure, in which the 
individual 


is not only presented with a graded 
anxiety-provoking stimuli (as, 
t of the above studies do not do), 
€ time given the opportunit 
perform an inepnpetible mera but de 
importantly, is enabled—nay, instructed —to 
Temove himself from the aversive situation 
as soon as the anxiety occurs. that is, as 
Soon as the response Considered antagonistic 
to that to be counterconditioned Sikes to be 


series of 
indeed, mos 
at the sam 
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dominant. It is only in this way that one | 
ensures that, as much as possible, the orga- 
nism is confronting the original aversive situ- 
ation (initially a watered-down form of it) 
without behaving fearfully. Thus, careful pro- 
gramming of the aversive stimuli is necessary 
in order to minimize the occasions on which 
the organism is performing in the presence of 
the CS that response which is the target for 
removal; providing only for graded presenta- 
tion of the CS is probably necessary, but 
rarely sufficient as well to ensure true pairing. 
Of course, if one is concerned only with pair- 
ing at the onset of the CS, then studies such 
as those reviewed above probably satisfy the 
criterion; that is, the animal eats as the CS 
comes on. However, as often happens in de- 
sensitization, the CS at some point disrupts 
the competing activity. It is this disruption 
that we are talking about; to the degree that . 
the anxiety-competing response does not per- 
sist throughout CS presentation we do not 
see pairing taking place. We should point 
out also that our conceptualization of counter- 
conditioning is more delimiting than Kimble’s 
usage, namely, “Extinction under circum- 
stances in which the response decrement is 
hastened by reinforcement of a response which 
displaces the original conditioned response 
[Kimble, 1961, p. 478]." 


Graded Exposure to Aversive Stimuli Paired 


and Unpaired with Anxiety-Competing 
Responses 


Systematic desensitization involves not only 
a pairing of competing responses with avers A 
stimuli but also the use of a graded hierarchy 
according to which these stimuli are DIC 
sented to the person. The argument has bo 
that the use of such a hierarchy makes © 
more likely that anxiety-competing relaxati? 
Will predominate over the anxiety ordinary 
elicited by the particular generalized aver?! te 
stimulus. We have seen from the section JUS 
above that considerable reduction of anxiety 4 
avoidance is possible when animals are >a 
posed repeatedly to the full-intensity CS, an 
that, furthermore, this reduction is usually 
not inferior to that brought about when EU 
animal is perhaps enabled io perform an -. 
Compatible response during CS presentations: 
The only studies thus far covered tha 
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employed any kind of graduated presentations 
were those by Gale et al. (1966) and Heath 
(1969), though the problems already pointed 
out make it impossible to evaluate the neces- 
sity or desirability for graded presentations. 
Several studies have, however, been executed 
With animals which, in an analogue fashion, 
would seem to have bearing on the use of an 
anxiety hierarchy in desensitization in con- 
Junction with anxiety-competing responses. 

In the review already cited, Lomont (1965) 
concluded that the bulk of experimental data 
with rats did not indicate that progressive 
exposure to the CS results in faster extinction 
than exposure to it at training intensity. 
Most of the studies discussed were within the 
Murray-Berkun (1955) displacement model, 
in terms of which, following the establishment 
of an approach-avoidance conflict, rats which 
have been allowed to make goal responses in 
'displaced" alleys (alleys dissimilar to the 
Original one in width and color) eventually 
make approach responses in the original alley. 
While confirming this, Berkun (1957) did 
Show that when total number of trials were 
Considered, rats given displacement training 
required slightly more trials to reach a cri- 
terion of recovery than did control animals 
given trials only in the original alley. This 
negative finding was replicated by Noblin 
and Maher (1962) and Taylor and Rennie 
(1961). On the other hand, Taylor and 
Maher (1959) did find an economic advan- 
tage from the displacement trials. In the 
absence of any definitive account of the 
Complex variables involved in displacement 
responding, however, Noblin and Maher's 
(1962) caution in interpreting these several 
Studies seems appropriate. At any rate, these 
displacement studies are an inexact analogy 
to desensitization, in which gradual progres- 
sion involves a typically lengthy hierarchy 
of situations, not just one or two slightly 
differing ones, 

A more direct test of the importance of 
gradual stimulus presentation was attempted 
in Kimble and Kendall’s (1953) comparison 
of Guthrie’s (1935) “toleration” method of 
Producing extinction with the conventional 
method, which they term “exhaustion.” They 
found that the former was significantly more 
effective, and concluded therefrom that overt 


nonreinforced avoidance responses were not as 
crucially necessary for extinction as Hull’s in- 
hibitory postulates had held. Weiss and Mona- 
han (1969) criticized this widely cited study, 
pointing out that Kimble and Kendall’s toler- 
ation rats had considerably more exposure 
to the CS or variations thereof than did 
exhaustion animals. This disparity makes it 
impossible to conclude anything about 
progressive exposure to the CS. Additionally, 
one can pose questions about the nature of 
their “toleration” condition. Subjects were 
given seven “toleration trials" in eack of 
which the CS intensity was gradually raised 
from some initially lower-than-training level 
to the full training level. It is important to 
note that this procedure differs from one in 
which successive levels of the original CS 
are presented as discrete trials, and where 
progress to a higher level of CS intensity is 
contingent on the cessation of responses at 
the immediately preceding level. Such a 
response-contingent hierarchy we define as 
toleration, and we distinguish it from the 
presentation of a preprogrammed series of 
graded stimuli where progress up the hier- 
archy is noncontingent on the absence of an 
avoidance response (cf. Gale et al, 1966; 
Heath, 1969). 

In a tightly controlled experiment designed 
to overcome the methodological problems 
of the Kimble-Kendall study, Weiss and 
Monahan (1969) matched rats with respect 
to a stringent criterion of avoidance profi- 
ciency. Toleration subjects, during the 
treatment? phase, progressed through a hier- 
archy of the visual training CS, beginning at 
25 volts and increasing by 5 volts on each 
trial, provided no avoidance response was 
made (thus fulfilling the criterion we have 
suggested above vis-à-vis analogue studies of 
the clinical desensitization procedure). Ex- 
haustion subjects were yoked to toleration 
subjects, in the same fashion employed by 
Davison (1968), that is, number and dura- 
tion of exposures to full-intensity CS being 
determined by the behavior of the toleration 
“partner.” Results showed a striking superior- 
ity of the toleration group in extinguishing 
the avoidance response. : 

In addition to differences in the dependent 
measure—number of avoidance responses on 
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test extinction trials—toleration subjects also 
emitted fewer avoidance responses during 
their treatment, in fact, about one-fourth of 
those emitted by their yoked exhaustion 
mates. In order to achieve precise control 
over sheer exposure to the CS and its varia- 
tions (the problem noted with the Kimble- 
Kendall study), the warning light remained 
on during all treatment trials for both groups, 
in spite of whether an avoidance response 
occurred. The success of the toleration pro- 
cedure probably lay in significantly reducing 
the number of avoidance responses during 
treatment. This served both to expose those 
animals to more unreinforced presentations 
of the training CS (albeit via its generaliza- 
tion gradient), which is known to facilitate 
extinction (Kalish, 1954), and to reduce 
response-maintaining “safety signals” associ- 
ated with the occurrence of an avoidance 
response (Bolles & Grossen, 1969). It should 
be noted that Weiss and Monahan employed 
& nonconventional procedure in which the 
occurrence of an avoidance response did not 
terminate the CS, which remained on through- 
out each trial. This and other methods of 
forced exposure to the CS are discussed below. 
Thus far there have been only two studies 
attempting, within a single design, to com- 
pare the efficacy of graded versus ungraded 
exposure to the CS paired and unpaired with 
anxiety-inhibiting stimuli. Goldstein (1969) 
attempted to evaluate the advantages of pro- 
viding an incompatible response in addition 
to arranging for progressive exposure to a 
feared stimulus in an approach-avoidance 
situation. While he concluded that the 
anxiety-competing response (eating) facili- 
tated anxiety reduction, a careful examina- 
tion of his statistical analysis fails to justify 
the conclusion: not only did an analysis of 
Variance fail to yield a significant F ratio, 
but none of the treatment groups did, in fact, 

differ Significantly from each other, 
„Poppen (1968) chose the classical condi- 
tioning paradigm of conditioned suppression 
as an analogue to systematic desensitization, 
rejecting avoidance conditioning on the 
grounds that it is a Problematical measure 
f fear (see below). After teaching rats to 
a | pend reliably for food and then to 
uch pressing during a tonal CS, he 


assigned animals to one of several conditions, 
carefully matching for acquisition of the 
conditioned suppression: (a) “Regular extinc- 
tion," whereby the CS was always presented 
at original intensity; (^) "inhibition," in 
which, during the CS period, food was pro- 
vided gratis (this presumably being, then, 
ungraded exposure paired with food); (c) 
“toleration,” in which subjects progressed 
through five generalized tones before reaching 
training frequency, progress being contingent 
on a criterion of extinction at each tonal fre- 
quency (similar, therefore, to the toleration 
group of Weiss and Monahan, 1969); and 
(d) "toleration plus inhibition," in which 
free pellets were made available during 
the same kind of graded presentation as 
in the toleration group (hence an analogue 
to desensitization). 

Two principal dependent easures were 
used, and the results differ occasionally de- 
pending on which measure is examined. In 
terms of mean number of trials necessary to 
reach criterion of extinction, all groups but 
Toleration + Inhibition did not differ signifi- 
cantly, the latter alone being significantly 
superior to all the others. The striking superi- 
ority of this analogue-to-desensitization group 
seemed to account for the significant effects 
associated with the two factors, type of CS 
(graded versus full) and presence or absence 
of food. Providing a hierarchy did not prove 
superior to sheer repeated presentations of 
the original CS, and providing food paired 
with the CS expedited extinction only when 
the CS was presented gradually; associating 
a presumably anxiety-competing response with 
repeated presentations of the CS at training 
intensity did not expedite extinction over and 
above sheer repeated exposure, results con- 
sonant with Moltz (1954) and Nelson (1966). 

Whatever the merit of Poppen’s reasoning 
about an adequate measure of fear, howeve! 
a classical conditioning paradigm, unlike the 
avoidance situation, does not meet a funda- 
mental requirement of systematic desensitiza- 
tion, namely, that exposure to the aversive 
stimulus be under the control of the subject 
and not the experimenter, Although the rats 
were said to have eaten most of their pellets 
during treatment sessions, which suggests that 
fear was not such as would have led to much 
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escaping from the CS, the procedural discrep- 
ancy with systematic desensitization must be 
considered. 

Additional findings can be mentioned. 
During each “treatment” session required to 
achieve the criterion of extinction, groups did 
differ with respect to the amount of response 
Suppression (“anxiety”) that occurred. It is 
logically possible for two rats to take the 
same number of sessions in order to extin- 
guish and yet to “suffer” differentially while 
doing so. Poppen employed a “suppression 
ratio” in order to make this comparison, and, 
when comparing the groups on this measure, 
some interesting differences did emerge. An 
analysis of variance once again showed effects 
for mode of CS presentation (graded versus 
full) and availability of an antagonistic re- 
sponse (food versus no food), but now many 
between-group differences emerged. Once 
again Toleration + Inhibition surpassed the 
Other three groups in terms of extinction, or 
rather ease of extinction. Regular extinction 
Was shown to be significantly inferior to all 
other groups, which means that either pro- 
viding food (inhibition) or a graded hierarchy 
(toleration) reduces the amount of fear during 
treatment. 

Thus, one can say that considerable support 
is found for the superiority of graded expo- 
Sure being paired with anxiety-competing re- 
Sponses as compared to graded exposure alone, 
and some support is given for the superiority 
Of graded exposure over repeated exposure 
to the CS at training intensity. Moreover, 
With regard to the issue of repeated exposure 
to the CS paired or unpaired with anxiety- 
antagonistic responses, some support is given 
for the preference of the former over the 


latter, 
it is clear that all too few 


In summary, 
methodologically sound studies have been exe- 
f a progressive 


cuted to test the efficacy 0 

hierarchy, with or without accompanying 
anxiety-competing responses. From the above, 
it seems wise to conclude tentatively that 
gradual exposure to the CS might be prefer- 
able to the same amount of exposure to the 
Original CS, at least when the avoidance re- 
SPonse is not prevented (Weiss & Monahan, 
1969), particularly if one is concerned that 
the process of treatment be as unstressful as 


possible (Poppen, 1968; Weiss & Monahan, 
1969). The only experiment adequately de- 
signed to assess the gains from adding an 
anxiety-competing response to the graded 
exposure (Poppen) confirms the desirability 
of providing such an anxiety-inhibiting re- 
sponse, with respect both to ease and efficiency 
of extinction treatment. 


* Resensitization": The Issue of Forced 
(Prolonged) Exposure 


A frequently stressed procedural prescrip- 
tion in systematic desensitization is that as 
little anxiety as possible be elicited in the 
subject/client. This goal is pursued through 
(a) the imposition of anxiety-competing re- 
laxation, (5) the use of a carefully calibrated 
anxiety-hierarchy, and (c) the provision of 
escaping from an aversive scene as soon as 
it evokes even the slightest degree of per- 
ceived discomfort. According to Wolpe, not 
only will the autonomic effects of relaxation 
be able to “reciprocally inhibit" only rela- 
tively weak anxiety responses, but “Exposure, 
and prolonged exposure in particular, to a 
very disturbing scene can seriously increase 
phobic sensitivity [Wolpe, 1969, p. 127].” We 
have already seen, however, that this integral 
part of the clinical lore in behavior therapy 
is contradicted by the well-established phe- 
nomenon of experimental extinction. And as 
we have stated elsewhere (Davison, 1968), 
the experimental literature lacks not only 
data showing acquisition or reinforcement of 
stable conditioned fear responses in humans 
via pairing with supposedly secondary rein- 
forcers( like an anxiety image), but is notably 
as well in demonstrations of stable 

humans through rein- 
ul electric shock as the 


lacking 
conditioned fear in 
forcement via painf 
unconditioned stimulus (UCS)? 


as often cited a study by Campbell, 
Laverty (1964), in which alcoholics 
during 2 minutes of muscular pa- 
an injection of Scoline, which 
pairing resulted in an apparently conditioned fear 
reaction extremely resistant to extinction ; rather than 
viewing this experiment as demonstrating a typical 
aspect of human neurotic behavior, we see it as a 
bizarre demonstration that bears only the most 
emote similarity to what is found in civilian life, 
de far removed from the kind of forced 


rtainl 5 $ 
E to a distressing image that Wolpe cautions 
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On the other hand, there are indeed scat- 
tered research findings, intriguing and para- 
doxical, that may lend some plausibility to 
Wolpe's clinical assertion regarding resensi- 
tization, and these should be mentioned as 
well. Eysenck (1968) has cited an experiment 
by Napolkov in which unreinforced presenta- 
tion of the CS after only one CS-UCS pairing 
produced increasing levels of blood pressure. 
In fact, the conditioned response (CR) rap- 
idly grew greater than the original uncondi- 
tioned response (UCR)! More recently, 
Rohrbaugh and Riccio (1970) have demon- 
strated what they term the “paradoxical 
enhancement” of fear in rats. In one experi- 
ment rats were exposed to unreinforced condi- 
tioned fear stimuli for 0, 3, 5, 15, or 50 
minutes. Fear was subsequently assessed in 
terms of suppression of approach to food or 
water with the CS present. While the 15- 
and 50-minute exposures resulted in some 
extinction (the usual finding), the 5-minute 
group showed more residual fear than non- 
exposed controls. In a second experiment, 30- 
and 60-second interpolated exposures to the 
CS increased sensitivity as compared to 300- 
second exposure subjects and nonexposed 
controls. The investigators concluded that 
paradoxical enhancement is likely to be an 
exceptional rather than a typical outcome 
of unreinforced exposure to the CS, though the 
necessary and sufficient conditions for its 
occurrence must surely be delineated through 
careful parametric studies. : 

Forced exposure to an aversive CS. Most 
of the avoidance studies discussed in this 
Paper are conceptualized in terms of two- 
factor theory of avoidance learning (Rescorla 
& Solomon, 1967). This view emphasizes the 
classical extinction of fear, which is said to 
motivate the instrumental response. This is 


optimally achieved by nonreinforced, inescap- 


against. Moreover, a sul 
same research 
Sanderson, 


e wing true conditioning (CS paired 
with UCS) as compared with pseudo-conditioning 
(UCS alone). In addition, a more recent paper by 
h ; and Martin (1968) reported that 
ollowing Such treatment with 12 alcoholics, the 9 
patients located at a l-year follow-up showed little 
effect of the traumatic conditioning treatment. 


able exposure to the CS which prevents any 
short-latency avoidance response that might 
conserve anxiety by removing the subject 
from the CS before the fear CR develops 
(Solomon & Wynne, 1954). 

Baum (1970) has systematically explored 
the effect of preventing an avoidance response 
on extinction. He has found that the duration 
of the response prevention used is a critical 
factor influencing the results achieved with 
this method. In general, the greater the 
duration of forced exposure to the CS, the 
greater the extinction; too short an exposure 
has little effect on extinction. The more re- 
sistant to extinction the response is (viz., the 
greater the “anxiety” is as a result of a 
high intensity US or increased amounts of 
shock exposure), the longer is the period of 
response prevention necessary to eliminate the 
avoidance response. The extinction-producing 
effect of response prevention can be signifi- 
cantly facilitated by either introducing non- 
fearful rats into the apparatus (such as Hall, 
1955, did) or forcing the rat to move around 
the experimental compartment. Systematic ob- 
servation of the rats’ behavior during response 
prevention and the success of "social and 
mechanical facilitation" strategies suggest 
that eliciting nonfearful behavior during expo- 
sure to the aversive CS is an important vari- 
able in producing extinction of an avoidance 
response. 

Page (1955) had earlier reported that re- 
sponse prevention or blocking facilitated ex- 
tinction of the avoidance response not because 
fear had been reduced, but because confine- 
ment resulted in subjects learning a “freezing 
response which was incompatible with the 
measurement of avoidance. An independent 
measure of approaching the locus of shock for 
food revealed a residual fear in subjects 
whose responses had been blocked. Coulter, 
Riccio, and Page (1969) have replicated these 
results and found greater residual fear in rats 
undergoing response prevention than in rats 
extinguished in the regular manner. Kamm, 
Brimer, and Black (1963) found that 4 CS, 
to which the avoidance response had been 
extinguished, retained aversive properties for 
Subjects as demonstrated jn a conditioned 
Suppression paradigm. Evidence of this nature 
has led Poppen (1968) and Bandura (1969) 
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to reject the extinction of avoidance condi- 
tioning by means of blocking as an appropri- 
ate analogue of fear reduction in systematic 
desensitization. However, Nelson (1969) has 
shown that blocking can have a significant 
fear-reducing effect. In contrast to his and 
other studies cited above, Nelson conditioned 
fear, not by escape training, but by punishing 
entry into the shock compartment of a 
Mowrer-Miller box. As in the Coulter et al. 
(1969) and Page (1955) studies, fear was 
assessed by entry into the shock compartment 
for food, a procedure which did not con- 
found freezing with fear extinction. Similarly, 
using a passive avoidance paradigm (as dis- 
cussed above), which rules out an explanation 
in terms of the reinforcement of incompatible 
motor responses or freezing, Wilson and 
Dinsmoor (1970) found that blocking pro- 
duced more rapid extinction than the regular 
extinction method. 

_ It is important to note that another major 
line of theorizing and research which eschews 
two-factor theory in explaining avoidance 
learning also prescribes response prevention 
as an optimal extinction procedure. The 
discrimination theory holds that extinction is 
expedited to the degree that the subject can 
distinguish the change in reinforcement con- 


Ungencies between acquisition training and 
extinction (Bolles & Grossen, 1969; D'Amato, 
1969). Blocking eliminates 


1970; Herrnstein, ockir 
the response-contingent termination of the CS 
f the regular extinc- 


Which is characteristic o ! 
tion procedure, and which precludes à dis- 
crimination of altered reinforcement contin- 
8encies. This critical discrimination is also 
acilitated by “flooding,” or the procedure in 
which the CS remains on for an extended time, 
irrespective of the subject’s response, and 
Which has been shown to be superior to regu- 
lar extinction (Katzev, 1967; Polin, 1959; 
Shearman, 1968). . 
. The procedural implications for desensitiza- 
tion are clear: Wolpe prescribes that the 
Client/subject be able to escape from a scene 
as soon as it becomes aversive, and if one 
Construes the anxiety signal as an avoidance 
response, then desensitization is unnecessarily 
inefficient, Tt is interesting to note that 
Pe (1965), in arguing for the merits of 
€ "extinction? interpretation of systematic 
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desensitization, advanced the “fact” that the 
operations of systematic desensitization were 
optimal for producing extinction insofar as 
the client is not allowed to escape from the 
aversive stimuli. In fact, it is the client, and 
not the therapist, who determines the length 
of exposure to the aversive scene in systematic 
desensitization.* 


CoNCLUSION 


The foregoing review should make it clear 
that easy generalizations about the reduction 
of human fear with limited reference to ani- 
mal studies are misleading. The complex, in- 
complete, and often inconsistent nature of the 
extinction literature that Kimble (1961) 
noted is still evident, especially that area 
dealing with the elimination of fear /avoid- 
ance behavior. Nevertheless, some general 
conclusions can be drawn, The countercondi- 
tioning procedure has been shown to be more 
effective and less disruptive in reducing 
classically conditioned fear than the orthodox 
extinction procedures using conditioned sup- 
pression and passive avoidance paradigms 
(Poppen, 1968; Wilson & Dinsmoor, 1970). 
With avoidance behavior the provision of 
counterconditioning agents like food and 
chlorpromazine probably expedites extinction 
by facilitating exposure to the CS which would 
otherwise be avoided (Nelson, 1967). Weiss 
and Monahan (1969) have demonstrated 
how the toleration procedure achieves the 


same end. 
Extrapolating from these laboratory find- 


ings to the clinical setting, the provision of 
anxiety-competing relaxation with graded 


4The problems with analogues are well-illustrated 
by the issue of the anxiety signal as an avoidance 
response. At first blush, it seems rather straight- 
forward to assume that when an individual under- 
going desensitization signals anxiety and thereby. is 
removed from the aversive image, he is performing 
a response functionally identical to, for example, a 
wheel-turning response by ie in a ee pue 
a . However, it is customary for a 
venient know beforehand what the pro- 
cedure of desensitization entails, namely, graduated 
nd repeated exposure to imaginal aversive stimuli. 
Th s, when the human subject “avoids” (or better, 
4 $ed X) is this really the same as the wheel turn? 
Sree at for the human subject knows that 


ji ubt it 
ede of this anxiety signal, he will have 40: CODE 
front the same stimulus again. 
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exposure to aversive stimuli may be necessary 
with clients for reasons other than those origi- 
nally held. A conceptualization of desensi- 
tization in terms of the extinction rather than 
counterconditioning paradigm is not incon- 
sistent with the necessity for relaxation. For it 
may be that rather than providing for 
“reciprocal inhibition,” since anxiety generates 
too little “reactive inhibition” to form the 
basis of “conditioned inhibition,” relaxation 
functions as did food and chlorpromazine in 
Nelson’s studies (1966, 1967), namely, as a 
facilitator for exposure. It must be remem- 
bered that phobics are, by definition, people 
who typically avoid their respective feared 
objects. In order to encourage them to expose 
themselves to what they fear, it may be neces- 
sary not only to create a situation that is 
graduated, beginning with relatively mild 
“doses” of the phobia, but also to reduce 
still further the distress to be elicited by 
stimuli far along the generalization gradient 
by providing the person with a comforting 
antianxiety response, like relaxation. Since 
progress up to the hierarchy is traditionally 
controlled by the client, he should be more 
likely to confront aspects of his fear if he is 
bolstered in some way, for example, by 
knowing how to relax. Morever, a recent 
experiment by Geer, Davison, and Gatchel 
(1970) with human subjects shows that elec- 
tric shocks cause significantly less distress if 
subjects only believe they have control over 


their duration, even when that control is 
illusory. 


The critical role of sheer 
CS is highlighted by studies of avoidance 
behavior which demonstrate that response 
prevention is an effective and efficient method 
of extinction. These studies have led to thera- 
peutic strategies relying on extended forced 
exposure to the CS (Stampfl, 1966). These 
techniques construe the client's anxiety signal 
in desensitization as an avoidance response 


and are directed toward preventing client- 


controlled termination of the aversive image 
in the belief that such escaping will simply 
serve to maintain the 


A fear. 

. Systematic desensitization has been pred- 
icated by Wolpe and his colleagues on the 
allegedly Sound experimenta] basis estab- 
lished in the learning laboratory, This review 


exposure to the 
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provides support for some of the procedural | 
prescriptions of desensitization at the same 
time that it raises serious questions about the 
conditioning explanations put forth by Wolpe. 
Particular issue is taken with the concept of 
“reciprocal inhibition," not the least of which 
is that it represents a shaky neurologization 
of a behavioral system (Hull) which has al- 
ready been shown to have serious logical and 
empirical shortcomings. In addition, experi- 
mental findings point to possibly new thera- 
peutic techniques other than those which 
Wolpe originally allowed. We would now 
anticipate, in addition to the sorely needed 
animal research, systematic investigation into 
the processes of the technique itself, intro- 
ducing modifications that are truly: derived 
irom more basic research. Findings from such 
research efforts will help pin down the ana- 
logue results and contribute both. to the im- 
provement of the (changed) technique and 
to the clarification of its process-mechanism 
underpinnings. 
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Present conceptualizations of stereotypes in psychology are outlined, and 
theoretical approaches linking ethnic stereotypes to negative attitudes and to 
discriminatory behavior are discussed. Research results concerning the content 


of ethnic stereotypes, 
reciprocal nature of 


their development and resistance to change, and the 
intergroup stereotypes are reviewed. Several critical issues, 


such as methodological problems and the “kernel of truth” controversy, are also 
discussed. It is suggested that none of the current definitions of stereotype 


are completely adequate. I 
conceptual and empirical, 
regarded as generalizations th 


who aífixes the label. There are several 


eralization may be considered unju 
approach are presented. 


. One of the terms most frequently employed 
i Tesearch and theorizing concerning ethnic 
attitudes, prejudice, and intergroup percep- 
lion and conflict is stereotype. The importance 
of this concept and the intellectual processes 
Presumed to underlie it have been emphasized 
by a plethora of research concerned with eth- 
hic attitudes and intergroup relations. 

Yet, almost 50 years after the term stereo- 
lype was coined, there is little agreement be- 
tween researchers and theorists concerning the 
it unction and importance of ethnic stereotypes 
IN social perception and behavior. At a more 
basic level, there is little consensus on just 
What a stereotype is. Although much, perhaps 
Most, “stereotype research” has proceeded 
With scant attention paid to such ambiguities, 
> Seems to this writer that the accumulated 
Tésearch products will necessarily remain 
fragmented and of limited value until these 
vonceptual questions are resolved, or at least 
ecognized. 

The conceptual work 
d the methodology developed by Ka 

raly (1933) have served as models for most 
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of Lippmann (1922) 
tz and 
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t is proposed that much of the ambiguity, both 
in this area may be reduced if stereotypes are 
at are considered as unjustified by the person 


iypes of criteria under which a gen- 


istified. Possible research implications of this 


of the subsequent theorizing and research in 
this area. Lippmann's original definition of 
stereotypes as being factually incorrect, pro- 
duced through illogical reasoning, and rigid 
was sufficiently broad that most subsequent 
researchers have simply identified ove of the 
above characteristics as the hallmark of an 
ethnic stereotype. This emphasis on differing 
criteria of stereotypes described by Lippmann, 
and the implications that accompany each of 
them, have led to considerable variance in the 
current conceptualizations of stereotype, as is 
pointed out in upcoming portions of this 
paper. 

Given this situation, one might expect that 
there would have been several distinct paths 
of stereotype research in psychology, each 
attempting to gather data on one of the char- 
acteristics of stereotypes described by Lipp- 
mann. However, most researchers, regardless 
of their theoretical orientation, have utilized 
the Katz and Braly (1933) research para- 
digm. It is suggested later in this paper that 
the data obtained from this paradigm have 
little direct bearing on 0y of the conceptual 
criteria presented above. Tt would seem ap- 
propriate, therefore, to take a fresh look at 
the concept of ethnic stereotype and the rele- 
vant data that have been gathered in the 
past 40 years. -— . "P 

The results of empirical investigations of 
stereotypes and stereotypers are therefore re- 
viewed. Since the most commonly investigated 
object of stereotypes has been Negroes, stereo- 
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types of Negroes are used as a continuing 
point of reference in this review. In addition, 
particular attention is paid to several issues 
that are of considerable current interest, 
such as whether ethnic Stereotypes must con- 
tain at least a “kernel of truth” in order to 
endure over time, and theoretical and empiri- 
cal work dealing with the relationship of eth- 
nic stereotypes to prejudicial attitudes and 
discriminatory behavior. After such an over- 
view of the area has been obtained, it would 
seem appropriate to attempt to “pick up the 
pieces” and clarify the meanings, implicit and 
explicit, of ethnic stereotypes. In brief, such a 
clarification points to the necessity for recog- 
nizing that the values and standards of the 
researcher or theoretician are inseparably in- 
volved in his classification of certain beliefs or 
generalizations as stereotypes. While such a 
recognition does not mean that the concept is 
stripped of its utility, it does imply that the 
standards of the researcher be made more 
explicit. Such an enumeration of these stan- 
dards, it is Suggested, can lead to potentially 
productive paths of meaningful research. In 
line with this suggestion, several research 


directions which seem to hold promise are 
described. 


DEVELOPMENT OF THEORY AND 
METHODOLOGY 
Origin of the Term 


Credit for introducin 
concept of stereot: 
mann. In his 1922 
mann was concern 
ment" which is in 


g and delineating the 
ype goes to Walter Lipp- 
book Public O pinion, Lipp- 
ed with the “quasi-environ- 
serted between man and his 
teal environment. In order to deal with a 
world which is too complex for direct acquisi- 
tion, man constructs a “picture inside his 


head” of the world beyond his reach, The 
form of this “picture” 


culturally deter 
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type of short-circuiting that “imposes a ce 
tain character on the data of our senses bi 
fore the data reach the intelligence [p. 98]. 
Lippmann reserved his strongest criticism f 
the content of the stereotype, however. Hi 
spoke of the “strange connection” betwe 
the stereotype and the facts, and the necessit 
that the stereotype and the facts must “al 
ways” part company. Finally, Lippmann e : 
phasized the rigidity of stereotypes and thei 
resistance to education. 


The Katz and Braly Paradigm 


The most frequently cited study of ethni 
stereotypes remains that carried out 4 
Princeton in 1932 by Katz and Braly (1933) 
Considerable attention is paid to this investi 
gation since, along with being the first em 
pirical study of ethnic stereotypes,? the para 
digm created by these researchers has serve 
as a direct model for the majority of subse 
quent stereotype research. 

One hundred Princeton students were aske 
to list the traits that they considered mos 
characteristic of 10 ethnic groups: Negroes 
Germans, Jews, Italians, English, Trish 
Americans, Japanese, Chinese, and Turks 
Subjects chose the traits from a list of 8 


traits which they were given. They were askel 
to 


Read through the list of words . . . and select thos 
Which seem to you to be typical of the particula 
ethnic group. Write as many of these in the followin! 
spaces as you think are necessary to character} 
these people adequately, If you do not find prope 
words . . . for all the typical ethnic group character 
istics, you may add those which you think necessar 


for an adequate description [Katz & Braly, 1933, F 
282]. 


After this procedure had been followed for al 
10 ethnic groups, subjects were asked to £ 
back and star the five words on each lis 
that seemed the most typical of each group 


Only the starred items were used in the dat 
analysis. 


3'The work of Rice (1926) and Litterer (1933) 
While antedating the Katz and Braly study, Was con 
cerned with occupational stereotypes, rather tha 
Stereotypes of ethnic groups, The term “ethnic E. 
used herein to denote groups that are of differi” 
national origin (eg. Turks, Germans), a group te 
together by a common religion and cultural herita* 


(Jews), and a group defined by racial characteristic 
(Negroes). 
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Nes showed a very high degree of 
nic grou on the assignment of traits to eth- 
iM F or example, Negroes were char- 
Nei as superstitious" by 8446 of the 
seen as E as "lazy" By 7590; Jews were 
viewed E rewd” by 79%; and Germans were 
Wiedsure S aee E by 78%. A 
was Aive definiteness" of the stereotypes 
ber of a. at by calculating the least num- 
i Enn that would have to be included 
Cast by th or at least 50% of the 500 “votes” 
hen o e 1 subjects for each ethnic group. 
Choose. a ist of _84 traits from which to 
ect ae e theoretical range is from 2.5 (per- 
"v by everyone) to 42.0 (perfect 
4.6 trait ent); the obtained range was from 
| Thus me (Negroes) to 15.9 traits (Turks). 
Consi md subjects’ categorizations showed 
I ule able agreement with each other. 
bs and Braly concluded that definiteness 
or tales bore no relationship to liking 
since t} ethnic group or prejudice toward it, 
" Mr nos disliked groups—Negroes and 
Stereot were the most and least definitely 
D rd ar aa This tendency toward a 
Btoups ve in the categorization of ethnic 
| fallaci was seen by the authors as evidence of 
fractal ous thinking: “Stereotyped pictures of 
long and national groups can arise only so 
Eu individuals accept consciously or un- 
Bla, oy the group fallacy attitude toward 
of birth and skin color [Katz & Braly, 


9 
33, pp. 288-289].” 


jl 
| ouem APPROACHES TO STEREOTYPING 
] ne of “Stereotype” in Psychology 
| Bout rect generalizations. Generalizations 
nic members of a category (in this case; an 
in two group) can be construed as incorrect 
Tor „general senses. The first possibility for 
Beneraly m directionality, for instance, the 
BE m that most members of Group X 
ets of P Y when, in fact, most iiem- 
S c roup X do not possess Attribute Y. 
agnit ond possibility for error is 1M the 
t ude of the generalization, for instance: 
E p ization that “almost all” (say, 
Attribute the members of Group X possess 
members Y, while in reality only “most” 
tribute (say 65%) of Group X possess At- 
- It would seem that this latter type 


17 


of error is what is usually referred to by the 
term “overgeneralization.” 

Katz and Braly (1935) proposed a view of 
stereotypes that has persisted, in some quar- 
ters, to the present day. They stated: “A 
stereotype is a fixed impression, which con- 
forms very little to the facts it tends to rep- 
resent, and results from our defining first and 
observing second [p. 181]." This position is 
shared by Klineberg (1951) and Prothro and 
Melikian (1954). 

A larger group of authors is content to 
equate stereotype with generalizations, which, 
although overgeneralizations, may still de- 
scribe situations that are extant, although to 
a lesser extent than that expressed in a 
stereotype. A number of authors propose such 
a view—among them Schoenfeld (1942), 
Zawadzki (1948), Bogardus (1950), Allport 
(1954), Wedge (1966), Krech, Crutchfield, 
and Ballachey (1962), Gilbert (1951), 
Campbell (1967); and Rokeach (1968). 

There is one major difficulty with defini- 
tions such as those outlined above. Character- 
izing something as invalid necessarily implies 
that a validity criterion is available. Yet, 
with reference to the actual distribution of 
traits within ethnic groups, empirical evidence 
is exceedingly scanty. Thus, in most cases, no 
criteria are available for assessing the factual 
validity of an ethnic generalization. Cer- 
tainly, if an ethnic generalization is made in 
exceptionless terms, knowledge of the range of 
individual differences alone would seem suf- 
ficient grounds for terming such a generaliza- 
tion invalid. If, on the other hand, the gen- 
eralization is phrased in terms of “most” or 
“some,” considering the present lack of rele- 
vant data, the validity of this generalization 


cannot be assessed. : s 
Generalization of unspect. fied validity. Many 
f the validity 


researchers treat the issue o 
(Abate & 


of generalizations as irrelevant 
Berrien, 1967; Blake & Dennis, 1943; Brown, 


1958, 1965; Child & Doob, 1943; Diab, 1962, 
Edwards, 1940; Sherif & Sherif, 
i & Upadhyay, 1960a). In this 
iew, any generalization is a stereotype. 
"hers (Fishman, 1956; Mann, 1967a, 
d, 1956; Schuman, 1966; Vinacke, 
1949, 1956) do not consider validity irrele- 
vant, but recognize that classing stereotypes 
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as overgeneralizations implies the existence of 
a criterion of validity. However, as already 
indicated, such a criterion has seldom been 
available. 

Categorizations and concepts. Some writers 
speak of stereotypes as the use (or misuse) of 
Categories. This orientation is expressed by 
Secord (1959): «A stereotype is commonly 
thought of as involving a categorical response; 
i.e., membership in a category is sufficient to 
evoke the judgment that the stimulus person 
Possess all [italics added] the attributes be- 
longing to that category [p. 309].” Similar 
orientations are expressed by Harding, Kut- 
ner, Proshansky, and Chein (1954), Sanford 
(1956), Saenger and Flowerman (1954), and 
Simpson and Yinger (1965). 

When stereotypes are treated in this man- 
ner, the category becomes isomorphic with 
the observer's concept of the group. Vinacke 
(1957) has proposed that "stereotypes should 
properly be regarded as concept-systems, with 
positive as well as negative functions, having 
the same general kinds of properties as other 
concepts, and serving to organize experience 
as do other concepts [p. 229].” Concepts are 
Seen as “cognitive organizing systems" which, 
in conjunction with attitudes, operate in the 
control of response, 

Concepts have bo 


Would agree; 
those features that 
unique experience wi 
the Concept personali 
differ from the con 
Stereotyping is a co 
Proposed, whose cru 
ality traits are iny 
well as Physical ch 


Pre- 
incorporated 
r ic manner, so 
1S, persons whose at- 


in a probabilist 
that 
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tributes do not match all of those included i 
the concept, can be recognized. 

Products of a “jaulty” thought process. 1i 
a stereotype is looked upon as a generaliza- 
tion that may or may not be valid, then the 
question arises as to what distinguishes a 
stereotype from an “ordinary” generalization. 
To many psychologists, the crucial difference 
is the way in which the stereotype is ar- 
rived at. This viewpoint expands upon Lipp- 
mann’s (1922) point that the stereotyping 
process is a type of short-circuiting, which 
“precedes the use of reason [p. 98].” d 

There appear to be two variants of this 
“faulty process” view. In the first sense, the 
process itself is seen as inferior, for instance, 
the person is using nonlogical reasoning. The 
alternate view is that the person’s basis of 
acquisition falls below acceptable standards, 
that is, he develops his beliefs from sources 
that should not be employed, such as hear- 
say. Klineberg (1951) stated this latter posi- 
tion as follows: *Unlike other generalizations 
: +. Stereotypes are based not on an inductive 
collection of data, but on hearsay, rumor, 
and anecdotes—in short, on evidence which is 
insufficient to justify the generalization [p. 
505].” 

The view that stereotypes are the result of 
an inferior process itself has been supported 
by Sanford (1956), Zawadzki (1948), and 
Bogardus (1950). Fishman (1956) presented 
this outlook in straightforward form: 


one man’s objectively and analytically determined 
facts can be another man’s stereotypes; in fact, ong 
man’s stereotypes may, subsequently, become the 
same man’s analytically derived facts. Thus it is not 
the data which necessarily undergoes a change, but 
the process of arriving at the data [p. 34]. 


It should be mentioned that writers who 
have taken the position that stereotypes are 
Invalid Overgeneralizations also may be in 
Congruence with the “inferior process" posi- 
tion. Presumably, if a person has himself 
developed an invalid overgeneralization, this 
Will have been accomplished either through 
some sort of faulty thought process or from 
an unacceptable basis of acquisition. (It is; 
of course, also possible that his generaliza- 
tion has been learned from a respected 
Source.) It does mor necessarily follow, how- 
ever, that the results of a faulty thought 


Ls 
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Process must themselves be invalid (Fishman, 
1956; Triandis & Vassiliou, 1967). It should 
be mentioned that although data on the 
process of acquisition should be obtainable, 
little work has been directed toward this 
end. Thus, as is the case with definitions 
stressing factual incorrectness, theorists are 
generally unable to present data that support 
their contentions. 

Generalizations characterized by rigidity. 
Still another criterion that has been used 
for the identification of stereotypes is their 
rigidity. To Kerr (1943), the hallmark of 
Stereotypes is that they “do not develop but 
remain ossifed [p. 37]." Meenes (1943), 
Bogardus (1950), Adorno, Frenkel-Brunswik, 

evinson, and Sanford (1950), Sanford 
(1956), Simpson and Yinger (1965), Katz 
and Braly (1935), Katz and Stotland (1959), 
Sinha and Upadhyay (1960a), Rokeach 
(1960), and Scott (1965) have also stressed 
rigidity, 
_ Fishman (1956) brought up a point that 
'S relevant to those descriptions of stereo- 
lypes that stress their rigidity. He pointed 
Out that a stereotype can be seen as rigid only 
if it is not modified in the face of (a) new 
information; (5) changes in a person's needs, 
Motives, or interests; or (c) a changed focus 
9f interaction. If there is not change in any 
a these areas, then there is no rigidity im- 
Dlied when a stereotype remains unchanged. 

hus, “rigidity may be an aspect of some 
Stereotypes (and of some attitudes in general) 

Ut it must not be of all [Fishman, 1956, 
p. 39].» 

Stereotypes as habits. Campbell (1967) has 
attempted to analyze stereotyping from within 

€ framework of Hullian theory. Essentially, 
Campbell is modifying a formula used to 
€xplain behavior into one to explain beliefs. 
lereotypes, under this rubric, are regarded 
b “perceptual &Ej;'s [Campbell, 1967, p. 
21].” Stimulus intensity, V, can be regarded 

the contrast between groups. Since a 
Rue value of V will lead to a greater sEn, 

e lows that the greater the real differences 
deca groups on any particular custom, 
Mater; of physical appearance, Or item of 

al culture, the more likely it is that 
feature will appear in the stereotyped 


im 
ery each group has of the other. 


The “intrapsychic” D, K, and H determi- 
nants are regarded as “projections” of the 
personality of the perceiver. The more remote 
and less well-known the outgroup, the more 
purely projective the content of the stereo- 
type will be. Also, among trait differences of 
approximately equal contrast (equal V), those 
which are drive (D) and incentive (K) rele- 
vant and which are familiar (H) are most 
apt to be noticed and represented in stereo- 
typed imagery. The implications of this ap- 
proach are that most stereotypes will be some- 
what valid (due to V). It is those cases in 
which the outgroup is relatively unknown 
where the stereotype may be almost com- 
pletely invalid. 


Why Are Stereotypes Undesirable? 


The prevailing orientation of social scien- 
tists toward the use of stereotypes has been 
summed up by Brown (1958): “They think 
it is at least irrational and probably wicked 
to subscribe to them . . . [p. 364]." As re- 
viewed above, most theorists have regarded 
stereotypes as “wrong,” either because they 
do not correspond to the facts about the 
ethnic group, or because they were arrived 
at through some unacceptable process. Yet, as 
already mentioned, criteria for assessing the 
degree to which stereotypes have met these 
standards (content and/or process) are 
largely unavailable. Thus, such formulations 
must remain at the level of assumptions. The 
widespread adoption of such assumptions has 
led Brown (1958) to ask, “Ts it possible that 
the social psychologist has used the. word 
stercotype to stigmatize beliefs of which he 
disapproves but which he does not know to 
be false? Has he perverted his science to 
achieve a moral purpose [D. 366]?" 

Brown and Campbell have attempted to go 
beyond the views discussed thus far, to iso- 
late those characteristics of stereotypes that 
irable. Brown (1958) proposed that 
pectability" of the popu- 

pes has been “convincingly” im- 
I ned. Stereotypes, he stated, are often not 
well-founded in direct experience; Ee som 
times serve to rationalize selfish be avita 
they are often not sensitive to acus he 
dence; and they ascribe to racial inheri a E 
that which may be a cultural acquisition. 


are undes 
the "intellectual res 
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a later work, Brown (1965) was content to 
summarize the objectionable aspect of stereo- 
types as their ethnocentrism and emphasis on 
cultural absolutism, the implication they 
convey that ethnic groups have inborn and 
unalterable psychological characteristics. 
Campbell (1967) listed four general charac- 
teristics that he felt were wrong with stereo- 
types. The first point is the “phenomenologi- 
cal absolutism” of the ingroup member's 
imagery of the outgrouper or minority group 
member. Naively, one assumes without ques- 
tion that the outgroup is as one perceives it; 
or as the ingroup informs one about it. The 


second deficiency is in the amount of between- 
group difference 
homogeneity 


acial, rather 
point is very 
cussed above. 


per believes that it is 
tics of the outgroup 
y, rather than recog- 
eexisting hostility that 


825]." e [Campbell, 1967, p. 
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types held by.members of one national group 
about the members of selected other national 
groups. Most researchers have used the Katz 
and Braly experimental framework in toto, 
often using the identical 84 adjectives (trans- 
lated when necessary). Simple recitation of 
the results of such research can serve little 
purpose here. The reader is directed to the 
articles themselves (see, e.g., Bastide & van 
Den Berghe, 1957; Chandra, 1967; Das, 
1962; De Bie, 1951; Jahoda, 1959; van Den 
Berghe, 1962; Zaidi & Ahmed, 1958) for 
specific results. 

Stereotypes of Negroes as assessed by the 
Katz and Braly paradigm have tended to be 
rather definite and quite negative. Such traits 
as superstitious, lazy, dirty, and ignorant usu- 
ally have headed the list in frequency of 
attribution. Recent psychological research and 
public opinion polling results indicate that 
although the United States white's view of 
Negroes may be becoming somewhat less 
negative, it is still radically different from 
his view of white Americans. It should be 
added that this view of Negroes is not lim- 
ited to the United States. Subjects in India, 
Lebanon, Great Britain, Pakistan, and other 
countries hold similarly negative stereotypes 
of Negroes (e.g., Prothro & Melikian, 1954). 
There is evidence that many Negroes accepted 
this view also. 

Bayton (1941) tested Negro college stu- 
dents at Virginia State College. Using the 
Katz and Braly technique, subjects were told 
to check the most typical traits for 10 groups; 
including “the Negro," Among the traits most 
commonly assigned to “the Negro” were 
musical, superstitious, very religious, lazy, 
loud, and happy-go-lucky. When asked to 
describe the "typical Virginia State College 
student,” however, the subjects checked talka- 
tive, intelligent, lucky, sportsmanlike, jovial, 
Sophisticated, and other more favorable traits. 
, Apparently, the description “the Nest 
implied “the lower-class Negro" or “the aver- 
age Negro, who is of lower-class” to these 
subjects (and perhaps to many white sub- 


jects in other studies also), Bayton com- 
mented: j 


Here isa group accepting what has been called the 
coe fallacy.” Also, they accept, without contra- 
diction, the stereotypes assigned other minorities 25 
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shown in their characterization of Jews. Moreover, 
they lar ccept the stereotype usually as 
themselves . . . [p. 102]. 

Jewish high school students also have tended 
to accept the prevailing stereotype of Jews 
(Dewey & Humber, 1966, p. 363). 

The importance of offering additional items 
of descriptive information is clearly demon- 
Strated in a study carried out by Bayton, 
McAlister, and Hamer (1956). These investi- 
gators differentiated the object groups by race 
(Negro-white), but also by social class 
(upper-lower). It was found that the resulting 
characterizations varied more as a function of 
class than of race. Thus upper-class whites 
and Negroes were seen as intelligent, am- 
bitious, industrious, neat, and progressive, 
While lower-class whites and Negroes were 
assigned the traits, ignorant, lazy, loud, and 
physically dirty, in common. Lest these find- 
ings be taken as evidence of the absence 
of racial stereotypes, however, the authors 
caution: 


It is altogether possible that when the subjects are 
asked to stereotype "white Americans" and "the 
Negro" they assume that most white Americans tend 
toward the upper-class positions while most Negroes 
tend toward the lower-class status [p. 841. 


Development oj Ethnic Stereotypes and Their 
esistance to Change 
_Although there has been little research 
directed toward analyses of the processes 
through which stereotypes arise, that which 
as been carried out has focused on Negroes 
as the object group. Skin color and racial 
Membership appear to be perceived as possess- 
ing both racial and social significance by 
Children by the time they reach age 4 or 5 
(Clark & Clark, 1939; Goodman, 1952; 
Morland, 1962). Numerous studies have indi- 
Cated that white skin color is overvalued and 
dark skin color undervalued by 
and Negro children (e.g. see Asher & Allen, 
1969; Goodman, 1952; Greenwald & Oppen- 
heim, 1968). But few researchers have in- 
Vestigated whether this early racial aware- 
ness and valuation is accompanied by the 


both white 


4 
one d and Perlmutter (1957) have also dem- 
formated the importance GI attditiqnal items of in- 
lon iit studies of “stereotyping. 
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differential attribution of stereotype-relevant 
traits to whites and Negroes. 

Taylor (1966) showed nine pictures ofa 
Negro and a white child engaged in some 
activity to children aged 6 to 10. For each 
picture, they were asked to identify which 
person had a given negative stereotype- 
relevant trait. He found that white children 
assigned the negative characteristics signifi- 
cantly more often to the Negro child in the 
photograph than to the white. In addition, 
Negro children assigned two of the traits 
(dirty and dumb) significantly more often 
to the Negro child than to the white child. 

An interesting analysis of the development 
of stereotypes in school children was carried 
out by Blake and Dennis (1943). White 
schoolchildren in Charlottesville, Virginia, 
were to check whether each of 60 traits was 
more common in the Negro, in the white, or 
“no difference.” Subjects in Grades 4 through 
11 were tested. 

The authors calculated a Preponderance 
Index, which was simply the percentage of 
subjects who assigned a trait to Negroes 
minus the percentage assigning it to whites. 
The traits most overwhelmingly attributed to 
Negroes were superstitious, move slowly, eas- 
ily frightened, better servant, flashy clothes, 
ignorant, sleepy-natured, and happy-go-lucky 
(all Preponderance Indexes above 80). Inter- 
estingly, the Preponderance Indexes for these 
and other traits increased dramatically for the 
older children (Grades 8 through 11). It ap- 
d that the young white child first ac- 
quired a generally unfavorable attitude toward 
the Negro. It was only as he grew older that 
he gradually learned to apply the "adult 
stereotype." Thus, the older whites applied 
traits to the Negro that were a part of the 
stereotype, but which were not im and of 
themselves uncomplimentary, such as cheer- 
ful, fond of music, religious, sense of humor, 

d less selfish. The younger 


sense of rhythm, an : phe 
children made no such differentiations and 
applied negative traits more indiscriminately 
S D 


Tego. 
j [a Sm longitudinal studies of the 
degree of stability or change of individual 
stereotypes Over time have not been carried 

t. Several studies have indicated, however, 
pu ] group -stereotypes-ean-snow 


n moda apti 


peare 
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considerable change in the íace of changing 
external situations. Thus, Dudycha (1942) 
and Meenes (1943) found that traits assigned 
to Germans and Japanese by Americans were 
more negative during World War II than 
they had been in pre-World-War-II studies. 
Sinha and Upadhyay (1960b) found that 
students in India assigned significantly more 
negative traits to Chinese a few months after 
the 1959 Chinese-Indian border dispute than 
they had a few months before the dispute. 

A series of studies has indicated that the 
traits most commonly attributed to ethnic 
groups show considerable stability over time. 
Centers (1951), for example, took the 12 
most frequently assigned traits for each ethnic 
group as found by Katz and Braly in 1932, 
and in 1950 asked subjects to match the 
adjective lists with the ethnic group they 
Tepresented. For all 10 ethnic groups, over 
70% of the subjects made the correct match, 
Hoult (1953) was unable to replicate these 
results, however, 

Using the Katz and Braly technique, 
Gilbert (1951) compared group stereotypes 
obtained from Princeton students in 1950 
with those obtained by Katz and Braly in 
1932. Although the ordering of characteristics 
varied somewhat, the traits most commonly 
assigned to the various groups in 1932 were, 
for the most part, assigned again in 1950. 
The students in 1950, however, showed less 

à mong themselves about assigned 
traits, that is, the overall group stereotypes 


were less “definite.” Gilbert interpreted this 
finding as ‘impressive eviden 


to the stereotyping tendency, 


of such formulations [Gilbert, 1951, p. 246]." 


y Katz and Braly 
bly greater than 
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that found by Gilbert (1951). Nevertheless, 
Karlins et al. posited that their findings indi- 
cate that “younger generations show more 
careful thinking about ethnic generalization 
than their counterparts of the 1930’s [p. 14].” 


Self-Stercotypes and Reciprocity of Ethnic 
Stereotypes 


While a group of subjects (themselves an 
ethnic group) may have stereotypes about 
another ethnic group, it follows that members 
of the second ethnic group will also have 
stereotypes about the first. This two-way 
reciprocal feature of ethnic stereotyping has 
been investigated by Vinacke (1949, 1956), 
Buchanan (1951), Riffenburgh (1966), Schu- 
man (1966), Triandis (1967), Campbell 
(1967), and Abate and Berrien (1967). Re- 
searchers working with multiple ethnic groups 
have found a significant amount of agree- 
ment between different ethnic groups! views 
of any specific rated group. This consistency 
has been found in Hawaii (Vinacke, 1949), 
Europe (Buchanan, 1951), and Pakistan 
(Schuman, 1966). 

Another possible product of cross-cultural 
research is the comparison of Group A's 
stereotype of Group B with Group B's char- | 
acterization of themselves (self-stereotype)- 
Campbell (1967) and LeVine (1965) have 
examined stereotype data from 50 respondents 
in each of 30 tribal or linguistic regions m 
East Africa, each region giving "attitudes 
toward 13 outgroups. They found that in 
many cases there was considerable agreement 
between neighboring groups and the target 
group. As might be expected, it has peer. 
found that traits regarded as typical of one's 
own group are usually “overvalued,” that 15; 
they are thought to be more favorable oF 
Complimentary by the groups than other 
ethnic stoups who do not consider the trait 
typical of themselves (Vinacke, 1956). 

Abate and Berrien (1967) have carried this 
Paradigm one Step further by examining 
Stereotypes, Self-stereotypes, and the esti- 
mated “real! characteristics of the group 
(which they term the “vereotype”), as mea- 
sured by the Edwards Personal Preference 
Scale (Epps ). Their subjects, Japanese and 
American students, were presented with 15 
behavioral orientations drawn from the EPPS 
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and asked to rate, on a 5-point scale, how 
well each description "fitted" Japanese or 
Americans in general. Mean self-report EPPS 
Scores constituted the *vereotype." 

Rank-order correlations demonstrated that 
each group was quite accurate in repro- 
ducing the other group's “self-stereotype”; 
but the correspondence between stereotypes 
and “vereotypes” was less impressive. The 
correlation between Japanese stereotypes of 
Americans and the American vereotype was 
.54, but the correlation between the American 
Stereotype of Japanese and the Japanese 
Vereotype was .20 for women and —.04 for 
men, Surprisingly, the respondents were no 
more accurate in perceiving the vereotype of 
their own national group than in perceiving 
the vereotype of the foreign group. American 
Seli-stereotypes agreed with their vereotypes 
to a much greater extent than was true for 
the Japanese subjects, however. 

The question arises as to whether the EPPS 
serves as a suitable criterion for the “true” 
characteristics (Lie, vereotype) of the groups 
in question. This situation is the obverse of 
One in which judgments of some sort (e.g. 
peer ratings) are used as the validation cri- 
teria and the instrument (e.g.; the EPPS) is 
Compared with these criteria. Certainly, if the 
Epps “vereotypes” are seen as an acceptable 
Validation criterion, then it would seem that 
the proposition that national stereotypes rep- 
resent the recognition of true group differ- 
ences (Campbell, 1967) does not hold in 
all cases, Of course, it should be noted 
that Campbell was speaking of interacting 
8toups—tribal or linguistic regions in East 

frica. Presumably, the two national groups 
Used by Abate and Berrien had had Jittle or 
10 opportunity to interact with one another. 


Relationships among Attributes of Stereotypes 


Edwards (1940) has proposed four 
"di i : hat might 
dimensions? of stereotypes tha g 

direction 


Profitably be analyzed: uniformity, 
(favorable or unfavorable), degree of favor- 
ableness, and quality (actual content). Uni- 
formity refers to the extent to which the indi- 
Vidual’s response is in accord with the re- 
Sponses of others (viz., earlier discussion of 
the “definiteness” of stereotypes). Under this 


rubric, then, most studies of stereotypes have 
dealt with the uniformity dimension. 

The evidence seems to indicate little or 
no relationship between uniformity of ethnic 
stereotypes and stated preference for the 
ethnic group. Taft (1959) has calculated rank- 
order correlations for some of the relevant 
published data, with the following results: 
The uniformity-preference correlation in the 
Katz and Braly (1933, 1935) studies was .07, 
in Schoenfeld’s (1942) study it was .19, 
while Gilbert's (1951) data produced a —.01 
correlation. Although their data are not 
quantifiable, Vinacke (1956) and Prothro 
and Melikian (1954, 1955) found a similar 
lack of relationship. Taít's own data, with 
familiarity partialed out, yield preference- 
uniformity correlations of .11 and .01. 

Although rated preference for the group 
seems to show little relationship to uniformity, 
favorability of the actual traits assigned does 
appear to be related to stereotype uniformity. 
Using ratings of the “degree of favorableness" 
of the trait adjectives, gathered after subjects 
had completed the trait-attribution task, 
Karlins et al. (1969) computed overall favor- 
ableness ratings for each ethnic group. Each 
trait's frequency was multiplied by its mean 
favorableness value, and the sum of the values 
was divided by the total frequency. They 
found that favorableness showed a .77 cor- 
relation with uniformity in their sample. 
Using the same favorableness ratings, Karlins 
et al. calculated favorableness-uniformity cor- 
ations of .16 and .68 for the data of Katz 
1933) and Gilbert (1951). Fur- 
f ethnic groups accord- 
ores derived from the 
above two studies correlated .75 and .52 with 
those derived by Karlins et al. (This finding 
is valid only to the extent that the traits 
have remained constant in their perceived 
favorableness over this period, of course.) 
These findings, contrasted with those of Taft 
(1959) discussed above, suggest that mean 
favorableness of stereotype about an ethnic 

a good indicator of preference for 
7] hen these are averaged 


rel 
and Braly ( 
ther, rank orderings 0 
ing to favorability sc 


group is not 
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over all subjects. 

One attempt has been made to relate the 
extent to which individual subjects agree with 
th atterns of trait attributions, to 
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attitudes toward the ethnic group. Working 
on the assumption that responses from 200 
college students will yield a rough representa- 
tion of the (college) subsample’s modal 
stereotypes assessed under the Katz and Braly 
paradigm, Brigham (1969) calculated a score 
denoting the degree to which each subject’s 
five traits assigned to Negroes agreed with 
the most common assignments made by the 
group as a whole. This score was then cor- 
related with score on a much-validated mea- 
sure of attitudes toward Negroes. No rela- 
tionship was found (r = 03), although sub- 
jects’ agreement with modal trait assignment 
to (presumably white) Americans was slightly 
correlated (r = .14, p < .05) with anti-Negro 
attitude. 

Familiarity with the ethnic group does 
appear to show a nonambiguous relation- 
ship to uniformity of stereotype, however, 
In Taft’s reanalysis, Katz and Braly (1933), 
Schoenfeld (1942), and Gilbert (1951) ob- 
tained familiarity-uniformity correlations of 
51, .20, and -66, respectively. (It should be 
noted here that in Schoenfeld’s own analysis 
of his data, he did not feel that the 
familiarity-uniformity relationship was sig- 
nificant.) For Taft’s data, the familiarity- 
uniformity correlation was .85. Triandis and 
Vassiliou (1967), on the other hand, found 
that while American stereotypes of Greeks 
became more uniform with contact, the rela- 
tionship did not hold for the Greek stereotype 
of Americans, 

The familiarity-preference interaction was 
also investigated by Taft. While he found a 
very high positive relationship (+ = 92), pre- 
Vious studies did not yield such striking re- 
sults. As Scott (1965), Triandis and Vassiliou 
(1967), and others have pointed out, the 
Re sole, relationship would seem to 
Poitier e circumstances under which 

: y 5 acquired (e, 
» and the kin 
that. the groups maintain 

Vinacke (1957) has 


8. friendly coopera- 
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Suggested that findings 
to the ingroup (self- 
ao) be Considered as another facet of 

:cotyping. As might he expected, the traits 


(Campbell 1967, Child 


Karlins et al., 1969; Schuman, 1966; Sherif 
& Sherif, 1969; Vinacke, 1949, 1956). P 
thermore, as mentioned earlier, traits regarde 
as typical of one's own group are usually 
“overvalued.” ; 

To summarize the findings in this area, it 
appears that when data are summed over all 
subjects, the uniformity of stereotypes is 
related to favorableness of the most commonly 
included traits, but not to preference for the 
ethnic group. Individuals? agreement with the 
most common patterns of trait attributions 
does not appear to be related to attitude 
toward the ethnic group, at least when Negroes 
are the object group in question. Uniformity 
of stereotype seems to be positively peated 
to familiarity with the object group, and 
traits attributed to one’s own group tend to 
be more favorable than traits applied to 
other groups. The relationship between Fas 
miliarity (or degree of contact) and prefer- 
ence would seem to be dependent upon the 


circumstances under which the familiarity was 
achieved. 


CRITICAL ISSUES IN STEREOTYPE RESEARCH 
Methodology 


There are several recurring methodological 
weaknesses that have characterized much of 
the stereotype research reviewed thus far. 
Probably the most obvious problems are m 
herent in the Katz and Braly technique, which 
has been so widely utilized. This method, it 
will be remembered, directs the subject tO 
pick out (from a list he is given) those 
adjectives that he considers to be the “most 
typical” of each ethnic group. This procedure, 
then, may force the subject into thinking iM 
terms of generalizations and categories. f 

Eysenck and Crown (1948), in a study 0 
national stereotypes employing the Katz ani 
Braly technique, found that 19% of ther 
Subjects refused to take part in the project 
declaring that it was meaningless or ire 
sible. Of the remaining subjects, over half E 
them recognized that they were being pe 
to use Stereotyped answers they had hear 
elsewhere, Eysenck and Crown concluded: 
our results suggest 


respondents hav, 
at all... 


give the s 


Per contra that the majority E 
€ no particular views on fne subs 

When we look for stereotyped views, ae 
ubject no chance to reply in any nom 
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stereotyped fashion, we should not be surprised 
that the answers we get are stereotyped ones. It is 
strictly inadmissible to conclude from such results, 
however, that the views actually held by the subjects 
are stereotypes . . . [p. 


Brown (1965) reminds us that the 


investigators of ethnic stereotypes did not give their 
subjects the opportunity to be sensible men. They 
forced decisions on the basis of one item of informa- 
tion—ethnic membership. May not their subjects 
have been making the best of such a task. [p. 179]? 


Ehrlich and Rinehart (1965) compared re- 
sponses given to a Katz and Braly type of 
checklist to those given in an open-ended 
format. In the open-ended format, subjects 
listed about 5 traits per ethnic group; while 
they checked from 8 to 19 traits per group 
under the checklist condition. Differences be- 
tween mean number of traits listed for every 
the two conditions were 
Significant. Furthermore, although subjects in 
the checklist condition were told that they 
could add traits to the list, only 14 traits were 
actually added (0.2% of the total trait assign- 
ments made). 

Even the number and kind of ethnic groups 
included on a list seem to have a significant 
effect on the picture that emerges of any 
Specific ethnic group. Diab (1963a) found 
that modal trait attributions made about an 
Unpopular ethnic group (Russians) became 
more unfavorable when that group's name 
Was placed next to the name of à favored 
ethnic group (Americans). Further research 


(Diab, 1963b) has indicated that when the 
as most favorable by the 


ethnic group under 


ethnic groups seen s 
subjects are omitted from the study, overa 
trait attributions made to groups that 


had previously been viewed with ambiguity 


become more favorable. 


The Kernel of Truth Controversy 


Can a stereotype continue to exist In à 
culture without any “kernel of truth? what- 
Soever inherent in it? Debate on this issue 
Continues among those researchers concerned 
with ethnic stereotypes. The study most often 
Cited by those who propose that stereotypes 
can exist without any kernel of truth was that 
E LaPiere (1936). In the 1920s in southern 

alifornia, Armenian laborers were seen as 
(stereotyped as) dishonest, deceitful, liars, 
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and troublemakers. In fact, LaPiere found 
that Armenians appeared less often in legal 
cases, applied less often for charity, and had 
as good credit ratings as those received by 
the other ethnic groups. While it would thus 
appear that the stereotype maintained itself 
without any supportive evidence at all, the 
necessarily superficial nature of this study 
seems to leave the issue somewhat in doubt. 

Schoenfeld (1942) attempted to address the 
issue more directly with a study of “first 
name stereotypes.” After a series of pretests, 
Schoenfeld selected eight masculine and eight 
feminine names for which there seemed to be 
wide agreement on the personality character- 
istics “belonging” to each name. In the final 
phase, subjects were told to match each name 
with the appropriate personality character- 
istics. For both male and female subjects, 
there was considerable agreement, well above 
chance levels. Interestingly, only 7 of 248 
original subjects complained that the task 
was difficult or meaningless. Klineberg (1954) 
felt that this study “has demonstrated that 
stereotypes can and do develop without any 
basis in objective reality [p. 486].” 

Wells, Goi, and Seader (1958) studied col- 
lege students’ characterizations of persons 
who buy different makes of new cars. In 
1956, people who bought Plymouths were 
seen as “quiet, careful, slow, silent, moral, 
fat, gentle, calm, sad, thinking, patient, 
honest, content, and understanding." In 1957, 
however, Plymouth initiated a powerful adver- 
and students in 1957 (about 
70% of whom had also been included in the 
1956 sample) then described owners as "high 
class, important, rich, particular, different, 
and feminine [Wells et al, 1958, p. 121]: 
Thus, the image of Plymouth owners under- 
went a dramatic change in one short year. 

That facts can affect stereotypes has been 
shown by those studies that focused on 
change due to external events, for instance, 
war. The similarities between stereotypes of 
one ethnic group held by various other ethnic 
groups have led Vinacke (1956) to postulate: 
diculous to assert that groups of dif- 
ul ed iet don 

in generi acteristics which differentiate e 
certain E yg Sfint origin. . -+ D pus Mes 
superficial, and limited, they [national stereo ypes] 
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nevertheless generalize some actual cultural traits 
[p. 285]. 


In a similar vein, Margaret Mead (1956) 
proposed that “the elements which make up 
Such stereotypes are accurate but incomplete 
descriptions of national character in particular 
cross-cultural contexts [p. 222].” 

The findings of agreement between stereo- 
types of a group and that group's self-stereo- 
type would seem to lend support to the posi- 
tion that ethnic stereotypes can have validity 
(Campbell, 1967; Fischer & Trier, 1962; 
LeVine, 1965; Schuman, 1966; Vinacke, 
1949, 1956). One possible precondition to 
such validity is suggested by Triandis and 
Vassiliou (1967) who, interpreting their find- 
ing with regard to Greek and American sub- 
jects, observed: "The present data suggest 
that there is a *kernel of truth in most stereo- 


types when they are elicited from people who 


have firsthand knowledge of the group being 


stereotyped [p. 324]." 

On the other hand, Abate and Berrien 
(1967) have indicated that although there 
Seems to be considerable agreement between 
stereotypes and self-stereotypes, there may be 
very little correspondence between stereo- 
types and modal group characteristics (*vereo- 
type"), at least as 
inventory (th 
the problem 
interested in 
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In sum, there is ample evidence that ethnic 
stereotypes can have “kernels” of truth, at 
least in a convergent validity sense, that is, 
agreement between several groups as to the 
traits that characterize a particular object 
group. However, the absence of acceptable 
validity criteria leaves the question of 
whether all such stereotypes have some valid- 
ity unanswered. 


Relationship of Ethnic Stercotypes to Ethnic 
Prejudice and Discriminatory Behavior 


Ethnic stereotypes and prejudice. Probably 
the most basic characteristic linking stereo- 
types and ethnic prejudice is an awareness of 
discrete groups. Obviously, in order to feel 
negatively toward a group, one must be able 
to perceive the different individuals of the 
given ethnic group as having certain constant 
characteristics, as being similar to other indi- 
viduals in the same group, and as being dif- 
ferent from individuals not of that ethnic 
group. In other words, in order to be preju- 
diced against Group A (and, in the same 
sense, to hold stereotypes about Group A), 
one must have a concept (category) of the 
group. | 

Several studies (Allport & Kramer, 1946; 
Elliott & Wittenberg, 1955; Himmelfarb, 
1960; Lindzey & Rogolsky, 1950; Scodel & 
Austrin, 1957) have seemed to demonstrate 
that prejudiced persons are more accurate in 
their categorization of object-group members 
than are nonprejudiced subjects. These studies 
appeared to show that anti-Jewish subjects 
could more accurately identify photographs 
of Jews than could nonprejudiced subjects: 
However, with one exception (Himmelfarb; 
1960), all the studies have found that anti- 
Semitic Subjects judge more faces as being 
Jewish than do nonprejudiced subjects. Tt 
appears, as Secord and Backman (1964) DEA 
Posed, that the anti-Semite errs on the “safe 
side, putting a picture in the Jewish category 
if there is any likelihood that the person might 
belong there, It now appears that the en- 
hanced accuracy earlier attributed to presti 
diced persons was an artifact rising from thei! 
tendency to Judge a greater number of face? 
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to be Jewish, independent of the actual 
stimulus attributes.” 

The undifferentiated nature of concepts 
(categories) used by prejudiced persons has 
been investigated by Secord and his co- 
Workers. Secord, Bevan, and Katz (1956) 
selected 10 pictures of Negroes ranging along 
a rough continuum of “Negroidness” as 
judged by physiognomic variables. It was 
hypothesized that if the “overcategorization” 
factor of stereotyping is indeed operative, then 
there should be no difference in the personal- 
ity characteristics assigned by the prejudiced 
Subjects to the pictures varying in Negroid- 
ness, as long as they are perceived as 
belonging to the category (concept) “Negro.” 
This was the case, as the judges were very 
Similar in their attribution of traits to all the 
Negro pictures. The authors concluded that 
in the eyes of the high school subjects (in 
Atlanta, Georgia), “a ‘Nigger’ is a ‘Nigger’ no 
Matter how white he may be [p. 60].” 

Secord (1959) used a similar procedure. 
Subjects were given a list of 20 traits from 
Which to choose; 4 had previously been iden- 
tified as unfavorable aspects of the Negro 
Stereotype, 4 related to favorable aspects of 
this stereotype, and 6 traits referred to favor- 
able and 6 to unfavorable traits irrelevant to 
the cultural stereotype of the Negro. Again 
bictures of Negroes, varying in physiognomic 
Negroidness, were used. Subjects received a 
Dicture-stereotyping score, à verbal-stereo- 
typing score, and a questionnaire-prejudice 
Score, The subjects stereotyped all of the 
Pictures to an equal degree, regardless of the 
amount of Negroidness, while the more preju- 
diced subjects stereotyped all of the pictures 
to a greater extent. Verbal stereotyping Was 
related to prejudice to a greater extent 
(r—.60) than was picture stereotyping 
(r = 36). l 
. The most widespread view among writers 
in this area is that stereotypes perform a 
Junction for the prejudiced person: allowing 
him to rationalize the hostility that he feels 
toward the group. Simpson and Yinger 


a Perloc (1964) found that there was & tendency 

2 Subjects with strong favorable attitudes inna 

grades groups (prep school oF high. re 

NE be somewhat more accurate in the 
izations from pictures. 


(1965), Gordon Allport (1954), Saenger 
(1953), LaViolette and Silvert (1951), and 
Ehrlich | (1962a, 1962b) adopt such an 
orientation. 

Other researchers regard stereotypes as an 
actual component, the "cognitive" aspect, of 
prejudicial attitudes (Harding et al, 1954; 
Harding, Proshansky, Kutner, & Chein, 1969; 
Katz & Stotland, 1959; Secord & Backman, 
1964). Finally, Vinacke (1949) attempted a 
sort of synthesis of both of these orientations, 
stressing the "overlap" between stereotypes 
and prejudice. He proposed that stereotypes 
may serve as verbal expression of prejudice, 
or the rationalization or projection of it, or 
they may express à prejudice partly, or the 
stereotype may not be an expression of preju- 
dice at all. Similarly, a prejudice may be ap- 
parent in a stereotype, may help determine 
it, may make use of the convenient labels, 
or may express itself differently, or be 
present without stereotyping (Vinacke, 1949, 
p. 288). 

Longitudinal research into the development 
of negative attitudes and stereotypes, focusing 
on early childhood, might serve to shed light 
onto the actual relationship between stereo- 
types and prejudice. At present, empirical evi- 
dence on this question is relatively meager. 
Correlational analyses have indicated that 
ethnic prejudice and stereotyping seem to 
covary (Koenig & King, 1962, 1964; Saenger 
& Flowerman, 1954; Secord, 1959). Further, 
the ethnic categories of prejudiced persons 
appear to be less well-differentiated (Secord 
et al, 1956). While it seems to have been 
demonstrated that prejudice does not exist in 
the absence of stereotyping, little empirical 
attention has been directed toward investi- 
gating whether stereotyping can exist in the 


absence of prejudice." 
The widespread consen: 
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analysis has been proposed and endorsed 
without grounding in empirical testing. Al- 
though the operationalization of a test of 
“rationalization” might be difficult to set up, 
it would seem that research in this realm 
would be eminently worth while. 

Under the Katz and Braly paradigm, sub- 
jects have been willing (at least in the earlier 
studies) to stereotype virtually all ethnic 
groups. Unless some type of pervasive out- 
group prejudice is postulated, it would ap- 
pear that this indicates the presence of stereo- 
typing without prejudice. It follows that these 
stereotypes are not functioning as rational- 
izers. The question then becomes, How and 
why do stereotypes become “attached” to 
prejudice to serve as rationalizers? Unfortu- 
nately, the dynamics underlying this question 
have not been elucidated, 

Stereotypes and behavior. It h 
assumed that stereot: 
behavior, 
haps, 


as long been 
ypes have relevance to 
on an individual level and, per- 
on an international level also. For 
example, in the series of resolutions adopted 
by the General Conference of the United Na- 
tions Educational, Scientific, and Cultural 
Organization (UNESCO) in 1947 and 1948 
in connection with the Project on Tensions 
Affecting International Understanding, the 
Director General was authorized to promote 
a worldwide series of investigations of “na- 


tional stereotypes” (Klineberg, 1951, p. 505). 
While it seems logi 
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ticular cases. Unfortunately, data on these 
matters are not readily obtainable. c 

There is a similar paucity of data pertain- | 
ing to the relationship between the stereotypes 
of individuals and discriminatory behavior 
toward members of the ethnic group. On the 
one hand, researchers interested in stereotypes 
usually have not employed behavioral mea- 
sures. On the other hand, researchers con- 
cerned with actual behavior toward ethnic 
group members, typically Negroes, have chars 
acteristically employed only broad “attitude 
or “prejudice” measures, whose relationship 
to stereotyping is unclear. 

One attempt to assess the relevance of 
stereotypes to behavior (or, more specifically, 
to evaluations of individuals) has been made 
by Mann (1967a). He attempted to find out 
whether, when persons had attributed certain 
prevalences of traits to ethnic groups as a 
whole, their evaluations of specific ethnic 
group members would be consistent with the 
magnitude of their trait attributions for the 
entire group. 

Mann hypothesized that it should be rela- 
tively “easy” for individuals to stigmatize an 
entire group as an “abstract mass.” However, 
this blanket attribution of qualities would be 
more difficult in concrete individual cases. The 
outcome could be a readiness to see the ethnic 
group as “bad” but the individual ethnic 
group member as “not bad,” even if the 
asserted prevalence of badness in the group 
would make it improbable that an individual 
group member lacks badness, 

Working with white subjects in the Union 
of South Africa, Mann found that when it 
came to attributing traits to specific indi- 
viduals within the ethnic groups, no subject 
was inconsistent with his generalization when 
it had been exceptionless (10095).* For those 


* Mann reported that 17% of the generalizations 
made by his Subjects were exceptionless, that 15, 
100%. It could be expected that the prevalence of 
exceptionless generalizations about ethnic groups in 
the United States would be considerably less, con- 
trasting the racial climates in the two nations. In 
fact, research Carried out by this writer indicated 
that for United States college students in a some- 
what similar task, the percentage of generalizations 
that were exceptionless (100%) was 2.2%. Viewe! 
in another way, 42.5% of the subjects circled at 


least one “100%” in the 120 percentage-circlin£? 
they made, 
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subjects whose generalizations had not been 
exceptionless (20%, 40%, 60%, or 80% of 
the ethnic group supposedly possessed the 
given trait), however, about two-thirds of the 
subjects showed some inconsistencies. As 
would be expected, most of these inconsisten- 
cies were for situations in which 40% or 60% 
of the group was estimated to have a particu- 
lar trait. Contrary to Mann’s hypothesis, 
however, most of the inconsistencies were ones 
in which a random individual was assigned 
a trait (usually negative) that the subject 
had previously assigned to a minority of the 
group. Only about half as many subjects 
denied the presence of negative traits in 
tandom individuals from a group to which 
they had assigned a high prevalence (60% or 
80%) of the trait. 

Brigham (1969) attempted 
Vestigation of the extent to w 
generalizations are significant factors in the 
evaluation of, and behavioral intentions 
toward, ethnic group members (Negroes). 
He found that some of the percentages 
circled regarding the prevalence of traits in 
“Negroes” as a group were significantly re- 
lated to racial attitude and to average evalu- 
ations made of specific Negroes. However, 
generalizations made about “Negroes” on 
Specific traits were not generally related any 
More strongly to evaluations of individual 
Negroes on that same trait than to evaluations 
Of the individuals on other traits of similar 
affective loading. But while racial attitude 
showed a small significant relationship to be- 

avioral intentions toward individual Negroes, 
an overall stereotyping measure showed no 
relationship whatsoever (r = .04) to behav- 
ioral intentions. This stereotyping score, how- 
Ever, was significantly related to the evalua- 
tion of individual Negroes, which; in turn, Was 
Significantly related to behavioral intentions 
toward the individual Negroes. . 

On the basis of the above two studies, at 
least, it would seem evident that there is 
Not a simple relationship between th 
Sion of ethnic stereotypes and their 

thavior toward specific ethnic group mem- 
ers, Intrapersonal and situational conditions, 
es which such “use” may or may not 
in e, have yet to be specified. Further, 
gs us little development of assessmen 
es for determining whether such 


a further in- 
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takes place. Until considerable research con- 
cerning these issues is carried out, discussion 
of the stereotype-behavior relationship will 
have to remain at a largely speculative level. 


THE PRESENT STATE OF KNOWLEDGE 


In spite of the volume of research on 
ethnic stereotypes that has been carried out 
for the last 40 years, there seem to be rela- 
tively few “advances in knowledge” that 
have been made about stereotypes and stereo- 
typing. We know that subjects, when placed 
in the appropriate situation (ie., the Katz 
and Braly paradigm), will produce patterns 
of trait assignments to ethnic groups, which 
are similar in differing areas of the United 
States and, to a lesser extent, throughout the 
world. The patterns of trait assignments are 
also fairly consistent over time (Gilbert, 
1951; Karlins et al., 1969) although there 
may have been a general weakening of 
consensus in recent years. 

Although this consensus on attributed traits 
may be of interest from a sociological or 
statistical standpoint, it tells us very little 
about stereotypes or stereotyping. In the first 
place, the methodological weaknesses of the 
research paradigm make interpretations of 
the meaning of the experimental results tenu- 
ous at best. For example, the consensus that 
has been found appears to be, at least in 
part, a function of the methodology used, as 
Brown (1965) and Eysenck and Crown 
(1948) have pointed out. Second, the results 
that are observed seem to have significance 
only as a "measure of social sharedness, 
rather than of stereotypy [Scott, 1965, p. 
94]." Findings as to the extent that a given 
generalization is widespread among a group of 
people (subjects), even if methodologioal 
problems are ignored, do not prove of any 
value in elucidating what a stereotype is, how 
they are developed, or even how many people 
hold a stereotype. For example, how many 
subjects actually endorse (agree with) or 
use the generalization in behavior, and how 
many subjects are just reporting on their 
k aowledge of the traits that persons in one's 
ic = amonly attribute to the given 


most com 
eio oui (cf. Karlins et al., 1969, p. 15)? 


The magnitude of generalizations has also 
overlooked. Since most research 
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that are “most typical” of an ethnic group, 
we do not know whether the subject believes 
that a trait which is “typical” is found in 
20%, 50%, 90%, or all of the members of an 
ethnic group.* While some theorists have ap- 
peared to assume that the generalizations in- 
volved in stereotyping are virtually exception- 
less, recent research (Brigham, 1969; Mann, 
1967) has suggested that  exceptionless 
generalizations are quite rare. 

The psychological processes underlying 
stereotyping have been largely bypassed in 
the research literature. The mechanisms of se- 
lective perception, projection, rationalization, 
and denial are most often used in descriptions 
of stereotyping, while stereotyping is tied 
in with the general personality characteristics 
of ethnocentrism, authoritarianism (Adorno 
et al, 1950), and concreteness of thinking 
(White & Harvey, 1965). But little has been 
done toward the construction of a unified 
theory of stereotyping. Research into the pos- 
sible functional values of stereotyping and 
its implications for differential success of 
stereotype-change attempts has received little 
attention. 

For instance it may be that for stereo- 
types, as well as for generally negative atti- 
tudes, the functional bases may diffe 
different individuals 
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concerning the functional characteristics of 
ereotyping. 
P Mane researchers have proposed (with an 
almost audible sigh): “Stereotyping is uni- 
versal among all members of the human race 
[Bogardus, 1950, p. 289],” but most writers 
agree that stereotypes are undesirable and 
should be eradicated. However, there is little 
agreement concerning just what is the facet 
of stereotypes that makes them undesirable 
(e.g, inaccuracy, faulty thought processes, 
unacceptable basis of acquisition, rigidity; 
ethnocentrism, etc.). This lack of agreement, 
of course, is tied in with the similar lack of 
agreement as to what stereotypes are. 
Research concerning the development of 
generalizations about ethnic groups, their 
rigidity in the face of contradictory evidence, 
and especially the influence of these generali- 
zations on actual intergroup behavior has been 
infrequent and hampered by differing con- 
ceptualizations of the phenomena under 
investigation. Such research as has been car- 
ried out has failed to identify any clear-cut 
stereotype-behavior relationship. Finally, 
while the distinction between "social shared- 


ness" or "cultural consensus" and "personal 
stereotype" has been recognized by several 


authors, this distinction has made little impact 
on the research literature. 

Despite this present state of affairs, which 
would appear to be rather bleak in terms of 
level of knowledge attained, most researchers 
seem to share the view that the concepts of 
stereotype and stereotyping nevertheless can 
be of considerable value and importance in 
the understanding of human behavior. Since 
the present writer shares this view also, it 
would seem germane to attempt a reconceptu- 
alization and reformulation of the concept, 
both in terms of conceptual clarity and i? 
terms of potential for research. 


Erunic STEREOTYPES: A MODIFIED 
CONCEPTUAL FRAMEWORK 

There seems to be 

Psychologists that a 

zation (or belief) 

a generalization 

prevalence of a gi 


general agreement among 
stereotype is a generali” 
about a group; most likely: 
Concerning the perceive 
ven trait within an ethnié 
group. The term is also often used in a mov 


ETHNIC STEREOTYPES 31 


general sense to describe a particular pattern 
of traits that is commonly assigned to an 
ethnic group. To avoid confusion, it might 
be helpful to refer to the latter case as a 
general or overall “group stereotype,” re- 
serving the term “stereotype” to denote a 
single trait attribution. Certainly, if one 
wishes to categorize people as to whether or 
not they fend to stereotype, then attention 
would be given to patterns of trait attribution 
that they might. endorse. 

Although it seems clear that a stereotype is 
a type of generalization, it seems equally clear 
that not all generalizations are stereotypes. 
Surely, “stereotype” is not simply a synonym 
for "generalization"; the former term must 
have meaning above and beyond that attrib- 
uted to the latter. What is it, then, that dif- 
ferentiates a stereotype from an “ordinary” 
generalization? Or, more simply, what differ- 
entiates a “good” generalization from a “bad” 
generalization (i.e. stereotype)? Since none 
of the currently available conceptualizations 
seems completely adequate, it must be the 
case that some aspect of the use of stereo- 
type as a concept is being overlooked. If 
attention is given to those aspects of stereo- 
types that seem objectionable, a more ade- 
quate conceptualization might arise. 

If there is any single theme running through 
the views presented in this study, it would 
seem to be that when à person's belief is de- 
Scribed as a stereotype, he is being criticized 
for having failed in some Way to meet rational 
standards of objectivity. The shortcomings of 
Previous treatments of stereotype involve the 
assumption that the attributed lack of objec- 
tivity is always of one type—namely, that 
of accepting a generalization that is false, 
or is unsupported by available data, or 1S 
based upon illogical reasoning, or is held with 
inordinate rigidity, and so forth. 

The endorsement of a stereoty 
seen as involving a number of different errors. 
Perhaps the term most adequate for capturing 
the nature of these errors is unjustified. This 
term can have two distinct senses, both of 
Which may be relevant. On the one hand, as 
Suggested above, a generalization can be seen 
as unjustified when it does not correspond to 
the facts, or when one has not gathered suf- 


pe can be 


ficient evidence to support it. On the other 
hand, stereotypes may enter into verbal be- 
havior as justifications for prejudicial or dis- 
criminatory social practices, which them- 
selves may be viewed as unjustified. It is 
this latter use to which our attention has been 
called by the notion of “white racism” (e.g., 
Kerner et al., 1968). When practices of dis- 
crimination are all-pervasive, justification for 
such behavior is usually not necessary. How- 
ever, as Brown, Campbell, and others have 
pointed out, should these practices be criti- 
cized, then such generalizations may be avail- 
able to enable one to consider the discrimina- 
tory practices to be based upon natural, 
innate, or irreversible intergroup differences. 
One facet of the use of this concept which 
has not been commented on is the fact that 
the term is usually applied by someone to 
someone else’s generalization. That is, one 
does not typically label his own beliefs as 
stereotypes; the label is generally affixed by 
someone else who is observing, interpreting, 
or describing a person’s beliefs. The term 
stereotype, therefore, is generally attributed 
to a person's beliefs by someone else who is 
ay observing the situation, and the 
criteria used in classing the beliefs as stereo- 
types are those of the observer. It is the 
observer who makes the decision that the 
beliefs do not meet his own criteria of ac- 
ceptability and hence are stereotypes. (In a 
similar manner, certain negative attitudes may 
be labeled “prejudicial” by an observer of 
this situation.) 
The concept of stereotype may therefore 


be defined in the following manner: An ethnic 


stereotype is @ generalization made about an 


ethnic group, concerning @ trait attribution, 
which is considered to be unjustified by an 
observer. It is this observer who will decide 
that the generalization is or is not a stereo- 
ccording to his criteria of justifiability. 
f the theorizing and research 
reviewed herein, these observers have been 
psychologists or sociologists. Viewed under 
this frame of reference, then, the major short- 
coming of most of the published “stereotype” 
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the criteria under which they are considering 
the generalizations unjustified. And, in those 
cases where criteria have been described, such 
as factual incorrectness, representations of 
inferior judgmental processes, or justifications 
for discriminatory behavior, it has usually 
been left unsaid that these criteria remain 
assumptions on the part of the particular 
researchers, rather than measurable attributes 
of the situation." 

Some writers, for example, have proposed 
that the high intersubject agreement on the 
attribution of certain traits to certain ethnic 
groups provides evidence that these attribu- 
tions are invalid. Why should this be so? It 
Seems to this writer that the most reasonable 
interpretation of this approach would be that 
the researchers (observers) are assuming that 


such modal trait attributions must have been 
acquired from hearsay (or from an inadequate 
basis of acquisition) 


rather than through an 
“acceptable” learning situation, Yet, with a 
few exceptions (e.g., Brown, 1965; Klineberg, 
1951) this latter point has not been made. 
Even in cases where this approach has been 
formally stated, furthermore, writers have 
failed to note that until data bearing on 
the basis of acquisition are collected, such 


an interpretation must be regarded as an 
assumption or a working hypothesis, 
The chief 
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Implications for Research 


Measurement of stereotypes. Under the 
Katz and Braly format, a stereotype of an 
ethnic group has been treated as the traits 
most commonly attributed to that group by a 
sample of subjects. However, analysis of indi- 
viduals and the particular traits that they 
attribute has seldom been attempted. It ap- 
pears that most researchers have assumed 
that the more an individual agrees with the 
trait attributions made by the entire sample, 
the more he is giving evidence of stereo- 
typing. The rationale underlying this assump- 
tion may be that agreement with modal trait 
attributions is seen as presumptive evidence 
that such views have been obtained from an 
unacceptable source, for instance, hearsay, 
rather than from face-to-face contact, and are 
hence unjustified. 

However, this assumption has not been 
tested. The sources from which subjects ob- 
tain their ideas about which traits to at- 
tribute to different ethnic groups have not 
been elucidated. Therefore, the implicit as- 
sumption that such trait attributions are 
somehow unjustified would seem to be tenu- 
ous at best. Even if this assumption is ac- 
cepted, furthermore, few investigators have 
been interested in assessing the degree to 
which individuals agree with these modal 
trait attributions, that is, deriving individual 
stereotyping scores. To this writer's knowl- 
edge, only one such study (Brigham, 1969) 
has been carried out under the Katz and 
Braly paradigm. 

Since this paradigm suffers from so many 
methodological shortcomings, as pointed out 
earlier, other methods of assessing subjects 
beliefs about the prevalence of traits within 
ethnic groups would certainly be welcome. 
One possible method might be the previously 
discussed technique of having subjects circle 
the percentage of the ethnic group thought to 
have each of a number of listed traits. This 
method avoids the ambiguity that seems tO 
accompany the Concept of “typical” in the 
Katz and Braly Paradigm, while the re- 
searcher’s (observer's) conception of justified 
can be inserted readily, Tt may be that thé 
researcher feels that any extreme percentage 
assigned (e.g, over 80% or under 20%) 3° 
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unjustified and hence a stereotype. Or, it 
could be proposed that any percentage de- 
viating by more than one (or two) stan- 
dard deviations from the sample mean would 
be considered unjustified and a stereotype. 
In either case, each subject can then be given 
a “stereotyping score" indicating the number 
and/or intensity of the stereotypes that he 
endorsed. j 
A modification of this approach would be 
to ask subjects to give the range of percent- 
ages which they would think are “reasonable” 
or justified" for each trait relevant to the 
ethnic group. The subjects in this case could 
be either representatives of the populations 
under investigation (e.g., college students) or 
representatives of the "observer population” 
(e.g., social psychologists). The modal range 
given by the subjects for each trait could then 
be used as criteria for the classifying of the 
responses of other subjects as justified or 
unjustified; that is, circled percentages that 
are outside of the obtained modal range for a 
particular trait would be considered unjusti- 
fied and hence a stereotype. Again, each sub- 
ject would then have a stereotyping score, 
denoting the number of trait attributions for 
which his circled percentages did not fall 
Within the (modal) range of percentages seen 
as “justified” by the previous sample. 
_ A further extension of this technique could 
involve instances in which trait attributions 
to two groups are assessed. If, in the re- 
Searcher's (observer's) opinion, these two 
Sroups should be regarded equally (for. in- 
Stance, Jews and Christians, or American 
egroes and whites), stereotyping could be 
Operationally defined as instances wherein 
trait attributions were 0f equivalent for the 
Same trait, The magnitude of an individual's 
Stereotyping could, then, be seen as à con- 
tinuous variable, representing the difference 
between the percentages he attributed to the 
two groups for the same trait(s). 
. Learning of stereotypes. As indicated ear- 
lier, very few researchers have devoted their 
mime te m ame om 
whi and the sour 
a they are learned. Although a few cross- 
m studies of the content of generaliza- 
Age Rene ur children of different ages 
een carried out (e.g., Blake & Dennis; 


1943), it would seem that studies making 
use of longitudinal data are of great impor- 
tance. In a related vein, an investigation of 
the sources that are cited, by children of 
differing ages, as the basis for their ethnic 
generalizations would be of great interest. 

We might expect that as a child grows 
older and is exposed to an ever-widening 
sphere of social influence agents, his views 
and the sources from which he derives them 
could be considerably modified. It could be 
hypothesized that correspondence with pa- 
rental views regarding ethnic trait attributions 
might be lessened, at least in situations where 
the information imparted by other sources, 
for instance, peers, teachers, and communica- 
tions media, differs from that supplied by the 
parents. Even with relatively young children, 
questions of the *How do you know that?" 
type might provide considerable evidence 
regarding the source from which ethnic 
generalizations are learned. 

Degree of commitment and saliency. As 
many attitude theorists have pointed out, re- 
search on attitudes has typically ignored the 
or importance, of attitudes to indi- 
As noted, the conditions under which 
earned, and the conditions 
ange in an attitude, may 
great and little saliency. 
hat an attitude may 


saliency, 
viduals. 
the attitude is ] 
which may induce ch 
differ for attitudes of 


Similarly, the function t 
serve for the holder’s adjustment (Katz, 


1960) and the degree to which it is inter- 
related with other similar attitudes can 
have major consequences for the success of 
attitude-change attempts. The same possibili- 
ties exist for the analysis of stereotypes and 
their possible interrelations with other aspects 
of personality. 

A logical derivation from the research find- 
ings on attitudes toward ethnic groups would 
lead to the hypothesis that those stereotypes 
which are salient to the individual, to which 
he is committed (behaviorally, or verbally), 
that are buttressed by the endorsement of 
other similar stereotypes, and that are 
serving an ego-defensive function for the per- 
sonality, would be least amenable to change 
by rational, informational methods. 

Little or no empirical attention has been 

aid to the degree of commitment that sub- 
jects have toward the generalizations that 
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TABLE 1 
CORRELATIONS BETWEEN ATTITUDE TOWARD NEGROES 
AND TRAITS ATTRIBUTED TO NEGRO ES, PERCENT- 
AGES OF NEGROES EsTIMATED TO “Hay 
Each TRAIT, AND DIFFERENCES IN PER- 
CENTAGES ASSIGNED TO NEGROES 
AND (WHITE) AMERICANS FoR 


THE SAME TRAIT 
Katz and Braly 
paradigm 
" Percent- 
Trait — ane ae 
Percent- Signed to! minus 
age of - . | Negroes percentage 
cubjee’s| corres | See i 
attribut- attitude | Attitude | ^ oes 
ing i: X Attitude 
Intelligent 29** 24e 
Industrious oh? 3eek 
Friendly 209 22*k& 
Nationalistic -20* 03 
Religious =i 02 
ld 
.06 
a —.08 
Athletic 404 
THappy-go-lucky 24 —.17# b 
Hostile 27:5 .09 —.d44** 
Impulsive 18.5 iu 
24 
b 
— 40% 


Negroes 
traits, 


gative 


(all significance tests are 


they hold. Ni 


two-tailed), 


either the Katz an 


mity of stereotype and de- 


into account, could improve 
tion Considerably, 
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in question, little attention has been paid to 
the relationship between ethnic attitudes and 
Specific stereotypes (trait attributions). The 
results of one attempt to get at this relation- 
ship are discussed below. Subjects assigned 
traits to Negroes in the Katz and Braly para- 
digm and also assigned percentages for 15 
traits to “Negroes” and to (presumably, 
white) “Americans,” along with seven other 
ethnic groups. They were also given a 
widely validated measure of attitudes toward 
Negroes (Brigham, Woodmansee, & Cook"; 
Woodmansee & Cook, 1967). 

Correlations between the attitude measure 
and trait attributions, percentages circled, and 
"difference" percentages circled (Negro minus 
American) for the same traits are presented 
in Table 1. As can be seen, attribution of 
traits in the Katz and Braly framework was 
significantly related to racial attitude for 3 
of the 15 traits in question, while the rela- 
tionship with racial attitude was significant 
for 7 of the 15 percentages circled for Ne- 
groes, and for 6 of the 10 "difference" 
percentages. Interestingly, the 2 traits most 
often attributed to Negroes, and for which 
the mean percentages circled were highest— 
Athletic and Musical—showed no relationship 
to racial attitude. Percentages assigned to 
Negroes for all 15 traits, when placed in a 
stepwise regression analysis, yielded a multiple 
R of .52 with racial attitude, 

While this covariance betw 
tudes and beliefs about the 
traits in Negroes is certainly 1 
the data do provide informat 
degree to which the attributi 
traits covaries with attitude. 
terest would be 


een racial atti- 
prevalence of 
not surprising, 
ion about the 
on of specific 
Of further in- 
the degree to Which the over- 
all “favorability” of Specific attributed traits 
is linearly related to attitude. For instance, 
's the attribution or traits that may be 
neutral or favorable in affective tone (such as, 
Perhaps, happy-go-lucky, cheerful, sense of 
humor, pleasure-loving) an indication of a 
favorable attitude, or are such traits indica- 
live of a negative attitude when attributed to 
Certain ethnic Sroups, such as Negroes? 

1° Brigham, J. € 
S. W. Unpublished 
of verb, 
Negro 


å Woodmansee, J. J. & Cook, 
manuscript entitled “Dimensions 


al racial attitudes: Racial intermarriage and 
self-improvement.” 
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3 Stereotypes and behavior. As outlined ear- 
lier, little research has been carried out re- 
lating stereotypes to actual behavior toward 
members oí the object group. While attitudes 
have shown a small to moderate relationship 
with behavior toward group members in some 
Situations (e.g, Warner & DeFleur, 1969), 
it is an open question as to whether knowl- 
edge of a person's stereotypes allows any 
increase in predictability beyond that yielded 
by measures of general affect, that is, attitude. 
One research attempt. (Brigham, 1969) sug- 
gests that knowledge of subjects’ stereo- 
types of Negroes adds little to predictive 
Power beyond that attributable to general 
racial attitude. 

In the two studies cited wherein evaluation 
of specific object group members was assessed 
With reference to those traits included in 
individuals’ stereotypes (Brigham, 1969; 
Mann, 1967a) the results have suggested that 
Specific generalizations may not carry over 
into evaluations of ethnic group members on 
that particular trait. Instead, many subjects 
evaluated ethnic group members in a manner 
inconsistent with the generalizations they had 
endorsed about the ethnic group. The implica- 
tions of such findings are not clear. 

As stated near the beginning of this paper, 


this writer shares the conviction that ethnic 


Stereotypes may serve as significant determi- 
c group mem- 


nants of behavior toward ethni 
bers, Yet, on the basis of data available thus 
far, such a conviction must remain only a hy- 
Dothesis. It seems at least possible that much 
Of the ambiguity in this area can be resolved 
With the development of more sophisticated 
instruments for assessing stereotypes. If in- 
struments are developed that can measure 
saliency of the stereotype for the subject, and 
the certainty with which he holds it, for ex- 
ample, predictive power may increase. Natu- 
rally, consideration must also be given to the 
situational and personality variables that 
bear on any behavioral situation. With the 
development of better assessment procedures 
and the ability to control or at least specify 
the effects of various situational and person- 
ality factors, the role of ethnic stereotypes 
s determinants of evaluations of, and behav- 
T toward, ethnic group members can be 
etter elucidated. 
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MEASURES OF CATEGORY CLUSTERING IN FREE RECALL! 
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A major problem in the study of organization in memory has been the failure to 
agree upon an adequate measure of category clustering. À Z-score measure, based 
on a test of the number of runs of similar items in the recall list, is proposed. Several 


ncluding some th 


alternative measures, 
reviewed and their inadequacies discuss 
measures is made using data obtained 
viously published developmental study. 
widely in the r 
cept the null hypothesis w 
certain measures are biased to increa 


lengths. 


j Since its inception, the study of organization 
in the free recall paradigm has sought to 
account for the fact that the amount of cluster- 
ing usually increases with the number of items 
recalled (Bousfield, 1953; Shuell, 1969). 

A recurring problem in this field has been 
failure to agree on an adequate measure of clus- 
tering. The current review proposes a more 
adequate measure of clustering than any sug- 
Sested thus far, and demonstrates the short- 
Comings of other measures. 

There are certain minimal requirements that 
E^ have sought in a measure for the degree of 
Category clustering: 

: should be independent of 
on Phenomena that it may be used to explain 
eit Tn particular, the clustering waren 
nu St not increase simply as a function of the 

Imber of items in the recall list. 

2. The measure should adequately ac 
ot chance. First, the constitution of the 
"e should be considered: Items from a gut 
M cd are more likely to be recalled succes- 

ely if there is only 1 item from another cate- 
Sory than if there are 10 items from other 
ng on the recall list. Second, 


l. The measure 


count 
recall 


Ca i 
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nitude of effects they yield; ( 
hen the Z score indicates a significant difference; (c) 


ase solely as a function of longer recall list 


e widely used in the literature, are 


ed. A comparison between the various 
from a simulated experiment and a pre- 
It is concluded: (a) The measures differ 


b) alternative measures tend to ac- 


statistical reliability should increase with in- 
creasing numbers of items recalled; maximal 
clustering for a large recall list should be given 
more weight than that for a smaller list. 

3. The clustering score should mean the 
same thing, regardless of the parametric values 
that yielded it. For example, a difference score 
has been devised in which the degree of cluster- 
ing expected by chance is subtracted from the 
observed amount of clustering on a recall list 
(see below). The question is, Does a difference 
score of (say) 2 have an unambiguous meaning 
even though it may result from an expected 
value of 7 and an observed value of 9, or an 
expected value of 2 and an observed value of 
4? 

The Runs Test as a Measure of Clustering 

A convenient way to represent the degree of 
clustering in a recall list, which is in accord 
with these principles, is to express clustering in 
terms of the number of runs of items from a 
gory present in the recall list. The 
a run is defined as the number of 
recalled items from the same cate- 
gory, regarding single items as runs of one. The 
number of runs is one more than the number of 
times during recall that the category of the 
items being recalled changes. For example, 
suppose that there are four categories recalled 
in a given trial: 4 b, c, and d. The recall list 
aaabcdcc would have @ total of five runs starting 
ith a run of three and ending with a run of 


given cate; 
length of 
successively 


WwW 


"Mood (1940) has provide 


M „and variance, 
a randomly selecte 


d the statistics for 
ean. V,, for the number 
Hem d list of arbitrary 


of runs in 


407 e 


5 5 ve adapted the 
length JV and category composition W;. Wallis and Roberts (1957, p. 571) have adapte 
Pisas to a form useful for the present problem: 


N(N+1)- ZN 


M, = 
and 


V.- 


FREDERICK FRANKEL AND MICHAEL COLE 


X NY NY + NW +1) - S. NP- 
3 j 


j [1 
N > 


for a recalled list of N items, with JV; items 
present from the jth category. 

The mean number of runs 
distributed so that knowing 
observed number of runs (0. 
be calculated: 


» M, is normally 
M, V, and the 
), a Z score can 


Z m =M; 
NV, 


and reference can be made to a normal distri- 


bution table for the approximate probability 
that any individual clustering Z score is a 
Chance occurrence. We define clustering as the 
Presence of significantly “too few” runs. A 
significant positive Z score is the convention 
adopted here for simplicity. 
This procedure is easily adapted to calcula- 
tion of the degree of clustering observed in a 
group’s recall. Because we are operating with 
Z scores, this formula reduces to TA w 
n is the number of subjects in the 
Z is the mean Z Score for the group. 


This measure has several advantageous 
Properties. First, it takes 


here 
group and 


ple, again 
d to represent 


Nis, but the 
j$ has not been 

ata reported below 
mption appear reasonable for the kinds 
dies, 


p 
N*(N — 1) , 


nique used here can be applied to lists in whi 
items are repeated or intruded.) Another n 
list, aaabbaba, having the same N and On 
yields a Z score of — .21 (— -25/1.2). -. 
~ Second, perfect clustering is given ne 
Scores with higher JV: the recall list aa lis 
bbbbbcccccddddd yields a Z = 6.75; the list 
aaabbbecdd yields a Z = 3.92; and the E. 
aaaabbbb yields a Z = 2.29. All three isis M 
perfectly clustered. The Z-score differen y 
arise from a statistical consideration: i 
(Formula 2), having V(V — 1) in the denon 
nator, decreases with increasing N. d 
"Third, values generated by different Vs we 
Njs are comparable to one another becaus 
they are obtained in relation to the same, ao 
mal distribution, thereby removing any lengt E 
related bias. As the following examples illus 
trate, the relation between degree of clustering 
and N is by no means intuitive (note that T. 
recall list having a. Z score greater than 1. 
reflects statistically significant clustering, 3) 
< 05): abcedd (Z = 1.91), abbccdd (Z = 2.03)1 
and aabccccbddcd. (Z = 1.97). 2 
We will now show that all of these properties 
are not present in any one of several alternativ 
measures of clustering. 


Alternative Measures of Clustering 


Several alternative approaches to the me 
surement of clustering have appeared in E 
literature. One approach has been to uw 
Clustering in terms of the number of pairs 


25 u2) 
Successive items from the same category CN P 
that occur during recall. 


: iag ; ru 
The number of pairs within any given rU 


has to be one less than the length of that ru” 


Comparison of these measures is facilitated P. 
the following notation: 


> 


NP = 2, Gn; — 1) = 2: = 2; 1, p 
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where ni; is the number of items in the ith run, 
and & is the number of runs. Noting that 
2 m;— N, Y;1 — h, and substituting into 
i i 


Formula 3, it can be seen that the relation of 
the number of runs to the number of pairs is 
given by the formula 


k=N—NP. [t] 
Ratios involving the number of pairs as a 
measure of clustering are quite widely used 
(Cohen & Bousfield, 1956; Lambert, Ignatow, 
& Krauthamer, 1968). A score in terms of 
these measures represents the number of pairs 
Of clustered items per word recalled. (LR 
— NP/N) in Lambert's measure and per word 
minus 1 (RR = NP/N — 1) in Bousfield’s 
measure, Another similar approach is to count 
the total number of items appearing in runs 
larger than 1 and divide by N, yielding a per- 
centage in clusters (PC) measure. - 

Shuell (1969) has suggested that this division 
by N (or N — 1) serves to make the measure 
independent of V by “partialling it out.” 

All three of these measures share one serious 
shortcoming: They fail to account for the cate- 
gorical constitution of the recall list, and thus 
the chance likelihood of a given score becomes 
indeterminate. To take a simple case, if a sub- 
ject recalls four items with only one pair of 
clustered items, he would receive scores of .33 
on RR, .25 on LR, and .50 on PC. However, if 
three of the items were from the same category, 
the event (one pair) is much more likely than 
if the items were from three different cate- 
Eories, where a maximum of one pair is possible. 

This shortcoming has been recognized by 
several authors. It has been dealt with by cal- 
Culating the expected number of pairs (E) for 
à list of N; items recalled from each of j cate- 
Sories (Bousfield & Bousfield, 1966). E is given 


by the formula! 
poe ae 


es ; [5] 
dm N 


Bousfield and Bousfield (1966) suggested 
Construction of a difference score (DS), ob- 
Served minus expected number of pars 


umber of pairs, E, can 
Formula 4 into For- 
My. 


4 "s 

Bh de expected value of the ni 

o Pe obtained by substituting 
^. 1 which was the expected number of runs; 


(VP — E), as a new measure of clustering. 
This measure has been widely used (eg., 
Dallet, 1964; Scott, 1967). As mentioned 
earlier, a given DS is ambiguous, since it may 
arise from several nonequivalent sources. Also, 
for short lists the difference between VP and 
E is of necessity small, while for longer lists it 
can be quite large. Thus as Shuell (1969) 
pointed out, mediocre clustering of a large list 
may yield a larger DS than perfect clustering 
of a smaller list. Note that a larger DS given 
to a list with larger V does not arise from the 
statistical consideration mentioned earlier. 
After reviewing some of the inadequacies of 
RR and DS, Shuell (1969) suggested the use of 
the maximum possible number of pairs (max) 
for a given set of .Vjs, in combination with DS. 
Maximum clustering occurs when the number 
of runs is equal to the number of categories 
recalled (i.e., all the members of the jth cate- 
gory are recalled successively). Thus, if JV; is 
the number of items recalled from the jth cate- 
gory, then (substituting into Formula 3): 


max = 25 (N; — 1). [6] 


Relevant to Shuell's suggestion is a measure 
devised by Gerjuoy and Spitz (1966), using DS 
in the numerator and max — E in the denomi- 
nator [GR = DS/(max — E)]. However, GR 
has its own shortcomings: Negative GRs are 
possible, and a GR of — .60 is not necessarily 
analogous to a GR of .60 because max — E 
does not necessarily represent the maximum 
possible negative difference (i.e., the range of 


this measure is not always — 1.00 to 1.00 as 
one might expect). For instance, with Vjs oe 
1, 1, and NP =0, GR yields a — 1:5: 
Roenker, Thompson, and Brown (1971) have 
pointed out that the range of GR is invariant 
with list size. Although at first this invariance 
might seem a virtue, it means that the measure 
does not take into account the increased reli- 


ability of observations made on longer recall 


lists. 
One approach has n 
which might be conce 
ined distri- 
actice, we have found that the obtaine i 
i a px for GR from actual experiments is 
n iy negatively skewed (e.g., when applied to the 
PETO. et al., 1970, Experiment ID, violating the 
ew i assumption of analysis of variance. 
norma 


not yet been considered 
ved as similar to the Z 
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score. Bousfield (1953) has suggested the Index 
of Repetition, based on the normal approxima- 
tion to the binomial distribution. This measure 
has rarely been used, even by Bousfield. As 
Shuell (1969) pointed out, use of the measure 
requires two controversial assumptions which 
few investigators are willing to accept: First, 
the whole list of items is assumed to be avail- 
able to a subject whose recall contains only a 
subset of the total. Second, it is assumed that 
items recalled are sampled from the w 
with replacement. 

Since these assumptions do not 


existing data, this measure is not 
further. 


hole list 


agree with 
considered 


Comparison of the Various Measures 


Shuell (1969) calculated correlations between 
several different meas 


ures of clustering using 
data from an experiment on college students, 
The correlations obtained were very high, thus 
giving the impression th 
pared were generall 
one another, However, 
were relatively consisting only of 
college students, and only within-trial compari- 
ese factors limit the 
er of items recalled and 
use this particular sub- 
9 organize their recall to 
own, the number recalled 
he recall list inde- 


of the 


à more sensitive 
of various clustering 
methods of compari- 


the Purpose of det 


atically 
the particular 
ed by a y 

arbitrarily 
items each, 
all lists of 5, 10, 
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15, and 20 items, respectively, which pe 
chosen at random from the pool of 20. C usi 
ing scores for each of the various piger ied 
cussed in the previous section were calcula : 
for all subjects, and analyses of variance Me 
performed. The group means for two of E " 
measures, LR and PC, were found to mno 
significantly with increasing V (F = 5.70 F. 
4.70, respectively, df = 3/597, p < 01). m 
other measure was found significant (F < D); 

Why doesn’t RR also show this bias? The 
answer lies in the fact that for RR, 1 is sub- 
tracted from the number recalled in th 
denominator. This has the effect of a 
the clustering scores of recall lists with lon 
Ns proportionally more than scores of UN 
lists with higher Ys (the difference yo 5) 
and V — 1is relatively larger for smaller : 1 

In order to gauge the effects of list compong 
tion on the comparability between the er. 
measures, each kind of score was. calcula " 
using data obtained in an experiment P 
viously conducted by the authors (Cong / 
Frankel, & Sharp, 1971). This Oxperuuenn "i 
especially useful for present purposes, becat i 
a relatively wide variety of espaimar i 
manipulations and subject populations M 
involved, Providing a good range in vanae 
both V and composition. Briefly, the variab a 
involved were grade in school, order of stim "C 
lus Presentation, and type of stimuli. TIV 
trials were given to each subject. E 

The groups differed significantly with ed 
to N for all main effects, and with respect 


` " the 
the Z score for all main effects and four of th 
interactions. 


The results of analysis of variance performed 
on the other measures, in terms of the relativ? 
amount of difference between conditions est 
mated by each of the measures, are presente 
in Table 1, 

Several conclusions can be drawn from oe 
Parisons of the Proportions in Table 1. wi 
the Manipulations Produced large difference 
in the number recalled (V) as in the Grad 
Effect, all measures yielded similar conclusion? 
However, the relative amount of an increase " 
clustering as a function of Grade, estimated by 


* We also note 


d that RR had 
mean of .21, 


E ance 
a relatively high cha! 


MEASURES OF CATEGORY CLUSTERING 43 


TABLE 1 


PERCENTAGE INCREASE IN THE MEA 


S OBTAINED FROM THE CLUSTERING MEASURES FOR THE 


DEVELOPMENTAL SrUDY FOR SELECTED EFFECTS 


Effect PR LR PC DS GR Z score N 
à 1 
Grade (G) 53.7** 7 
Gr ; 53. 63.0** 374** | 24L5** + 7 
Stimulus order (So) 26.9** 26.7** 23.4** En e -39" OM 
Stimulus (5) 29.5** 5289" | 23.0 | 976 | 49.5% | 796" p 
Trials 21.6** 26.6** 1125* | 116.6** 17.0** 73.9% 43.5** 
Gxs 150.7* 1244 362 164.7** 384** | 106.5* 33.1 
oxS loo | —454* | —480** | —4L2* | —486 126* | —612 

* p «05. 

pe OL 


each measure, varies radically: DS estimates a 
difference 10 times greater than that estimated 
by the Z score. 

When effects were of smaller magnitude, and 
the corresponding F ratios smaller, the conclu- 
sions reached concerning the various clustering 
Measures differed greatly. For instance, RR 
and the Z score yielded ps less than .05 for the 
Grade x type-of-stimulus interaction, which 
would lead us to conclude that the magnitude 
of this variable was greater for the higher 
Grades, The effect would not be found signifi- 
cant if LR, PC, or GR were used (F = 2.96, 
51, and .46, respectively) and would be over- 
estimated if DS were used (F = 6.56, P < .01). 

In general, the alternative measures accepted 
the null hypothesis in marginal cases when the 
Z score found it to be false. 


Summary and Discussion 
res of categorical clus- 


This review of measu 
ard measures are 


tering indicates that stand 
misleading for a variety of reasons. RR, LR, 
and PC fail to take account of the categorical 
Makeup of a given recall list, an important 
determinant of the chance likelihood of obtain- 
ing a specified run length. PC fails to account 
Or lesser likelihood of larger runs; LR and PC 
Were found to be biased to increase with in- 
Steasing numbers of items recalled. Perhaps 
With larger list lengths (as those more com- 
monly used), RR too would be found to be 
ased, 
e earlier solutions designed to account 
io Chance likelihood of a given clustering 
€ based on Æ, the expected number of pairs 


int for 


in a list of arbitrary composition, were found 
inadequate for the present problem. The differ- 
ence score derived from Æ, like the scores which 
fail to consider chance levels of clustering, has 
different meaning when obtained from recall 
list of varying lengths and composition leading 
to overestimates for longer list lengths and 
certain list compositions. Inclusion of intuitive 
normalizing operations such as max — Æ (as 
in GR) in the calculation of the difference score 
is at best a rough solution, since max — Æ in 
the denominator is only loosely related to the 
theoretically appropriate denominator, and 
does not take statistical reliability into account. 

By applying the theory of runs to the present 
problem it was shown that a Z-score measure 
can be derived which is not subject to the diffi- 
culties inherent in other approaches and which 
has been shown empirically to be more sensitive 
than the other alternatives discussed. 
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COMPARISON OF MEASURES FOR THE ESTIMATION OF 
CLUSTERING IN FREE RECALL? 
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Recently developed measures of clustering in free recall are examined and shown to 
vary with characteristics of recall unrelated to relative amounts of clustering. 


Specifically, previous me: 


categories recalled, the distribution of the total items recalled across categories, 


asures were shown to vary with such factors as number of 


and 


total number of items recalled. For these reasons, comparisons between and within 


subjects, as well as comparisons between experiment 
justed ratio of clustering (ARC) measure, 


characteristics of recall, is proposed. 


The literature in recent years has shown a 
dramatic increase in studies of clustering in free 
recall stemming from recognition of the im- 
portance of these studies for understanding 
Organizational processes in memory. Accom- 
panying the development of this line of re- 
Search has been an increasing sophistication 
t of clustering. The relative 
any of the 
tensively 


in the measuremen 
advantages and limitations of m 
earlier clustering indexes have been ex 
discussed by Shuell (1969). This article con- 
cerns itself with an evaluation of four of the 
more recently developed measures and pro- 
Poses another index of clustering in free recall. 


Limitations of Present Measures of Clustering 


Two of the measures considered here, the 


Modified ratio of repetition (MRR; Bower, 
Lesgold, & Tieman, 1969) and the clustering 
(C) index (Dalrymple-Alford, 1970), represent 
Modifications of the original ratio of repetition 
Measure developed by Bousfield (1953). The 
Other two, the Bousfield and Bousfield (1966) 
deviation (BBD) measure and the deviation 
(D) index (Dalrymple-Alford, 1970), were 
designed to measure amount of clustering 
relative to chance. These four measures suffer 
from two types of limitation in that (a) 


ere is no fixed upper bound for D and BBD; 
repre- 


and (b) the same MRR or C score may 
Sent clustering above, at, Or below chance 
a cause chance is not set at zero. More specifi- 
ally, the score for perfect (i.e, maximum 
Public 
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are most difficult. An ad- 
which does not vary with irrelevant 


possible) clustering in the case of D and BBD, 
and the score representing chance clustering 
in the case of MRR and C, changes with (a) 
tHe number of categories that the subject re- 
calls, and (^) the distribution of the total items 
recalled across categories (i.e., given that a 
subject recalls 15 items representing 3 cate- 
gories, the distribution of items across cate- 
gories could be 5-5-5, 3-5-7, 1-1-13, etc.). 
It should also be noted that the BBD measure 
is affected by a third factor, the total number 
of items recalled. 

Because these measures do not reflect the 
actual category repetitions above 
chance to the total possible category repetitions 
above chance (hereafter referred to as relative 
amount of clustering), it is difficult to make 
comparisons between experiments (or be- 
tween subjects). Equally important, it is 
difficult to evaluate the relative amount of 
er trials for a single subject. 
For one set of measures (D and BBD), a 
score indicates clustering above 
but it is impossible to determine 
whether the score reflects perfect or less than 
perfect clustering. Similarly, for the other set 
of measures (C and MRR), it is also difficult 
to ascertain the relative amount of clustering 
since chance may be represented by different 
scores. These limitations point to the need for 
h will allow for comparison of 
s of clustering in different 


proportion of 


clustering ov 


positive 
chance, 


a measure whicl 
relative amount: 


experimental situations. 


alio of Clustering (ARC) Score 
tering which meets these 
been developed in our 


The Adjusted R 


An index of clus 


requirements has 


wo 
laboratory.” This is an adjusted ratio of cluster- 
ing (ARC) score in which chance clustering is 
Set at zero and perfect clustering at one. In 
other words, the ARC score represents the 
proportion of actual category repetitions above 
chance to the total possible category repetitions 
above chance for any given recall protocol. 
Hence, the 4 RC score is invariant with respect 
to factors unrelated to relative amount of 
clustering. The computational formul 


a for the 
ARC score is as follows: 


; R— E(R) 1 
ARE = maxR — E(R) [ 
where 
R = total number of observed category 
repetitions (ie. the number of 
times a category item follows an 
item from the same category), 
maxR = maximum possible number of cate- 
gory repetitions, and 
E(R) = 


expected (chance) number of cate- 
gory repetitions, 
It should be noted that 


maxR = N — k [2] 
f items recalled, and 
ategories represented in the 
And (from Bousfield & Bous- 


where V = total number o; 
k = number of c 
recall protocol, 
field, 1966) 


[3] 


ems recalled from 
as before. 


where n; = number of it 


Category i, and N is 


Comparison of Clustering Measures 


This section uses illustrative recall protocols 
to d 


lity of the 


measures are given 
ave been written in a 
m t to facilitate clarity of 
exposition, 7 


BBD = R E(R) [4] 


ript, it was 
had previously 
stering. However, 
ail the rationale for 
asure nor comment on its 
Sures of clustering. 


3 Following Preparation 
learned that Gerjouy and 
formulated the Same m 
these authors did not qj 
the development of th 
merits relative to othe 


of this manusc 
Spitz (1966) 
CaSure of clu 
Scuss in det 
e me; 
T mea: 
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TABLE 1 
Scores REPRESENTING CLUS 


G MIDWAY BE 
SRING FOR EACH 
SURES FoR Eacm or EIGHT 
ED RECALL PROTOCOLS 


Protocol ARC | MRR [^ D BBD 
30 | a | or | 33 50 
50 67 (67 38 | i 
49 73 E 32 Ee 
50 E 67 33 1500 
50 58 .58 42 5j 
50 | .58 | 58 | 42 209) 
EU E 07 32 d 
.50 .65 .65 | 33 Li. 


altw 


7 E ring to 9 
not possible to adjust the amount of Ean 
cy midway between chance and perfect ¢ 
ries. 


R [5] 
ee maxR 
R — mink 6] 
c= maxR — ming [ 


R= E(R) t7] 

maxR — minR 

where R, maxR, and Æ(R) are as before, and 

minR = minimum possible number of cate 

gory repetitions. mai 
It should be noted that (from Dalrymy 

Alford, 1970) 


fo if V+ 12> 2m s] 
minR = 4 cs 

IE otherwise 
where m = number of items of the type most 
numerous, and NV is as before, " 

A series of hypothetical recall protoco? 
Tepresenting clustering. performance e. 
between chance (as defined by Equation ~ 
and perfect clustering (i.e, maxR) are a 
below. In these Protocols, individual lette! 
(Le., a, b, c, ete.) represent individual item? 
(e.g., words), and repetitions of the same letter 
represent repetition of different items from tne 
Same category. The eight. protocols system 
atically vary with respect to number ? 
Categories that the Subject. recalls (3 or 6) 
total number of items recalled (18. or 59 
and distribution of th 
across Categories 
the 


jd 
e total items recall? 

[4 i 
(equal or unequal). Th. 
Sequence designated 3SE below is a 7 


category, short list (S; i.e., 18 items recalled) 
with an equal number of items recalled. (E) 
in each category. Similarly, the sequence 
designated 6LU is a 6-category, long list (L; 
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ie, 54 items recalled) with an unequal number 
of items recalled (U) in each category. For 
convenience in computation, maxR, “mink 
E(R), and R values are given for each seau 


3SE aaaaabacbbbbcbcecc 

R= 10 E(R)-5 maxA = 15 mink = 0 
SLE aaa aabbbbccccabaaaacccccabcbbbbbecceccaaabbbbccaabbaba 

=17 maxA = 51 mink = 0 

3SU 

R=11 E(R) = 7.1 maxR = 15 mink = 3 
3LU aaaaacbecececcabbbececebeccccbbbbeccaa: bebcececcaaaace 

R= 36 E(R) = 21 maxR = 51 mink = 5 
OSE aaaefbbbefecced (dd 

R=7 R) =2 maxR = 12 mink = 0 
6LE aaaaadedbbbcbaaaaceed bececy) jfecccccdddd ed bf f/f fbbdeb 

R28 maxk = 48 mink = 0 
6SU abeceeced fj 

R=8 =0 
6LU abbddddedccedeccecdejd f 


R=31 


m each of 


The clustering scores derived fro 
ight recall 


the five measures for cach of the e 
Protocols are given in Table 1. As can be seen, 
all measures except the -LRC measure vary 
With characteristics of recall protocols un- 
related to the relative amount of clustering. 
On the other hand, the ARC measure is 
invariant and indicates clustering midway 
between chance and. perfect performance. As 
noted previously, the vi intion in the other 
Scores results either from (a) an unfixed upper 
bound (perfect clustering score) or (b) failure 
lo adjust for chance. These sources of variation 
May be seen clearly in Table 2 which gives, 
Or each measure, the clustering scores that 
Would result if the sequencing of items within 
Cach of the eight recall protocols were re- 
Ser so that R equaled E(R) (anon T 
Tp (perfect clustering), ec e és 
ines table shows, the ARC measure is the only 
easure which both adjusts for chance and 
AS a fixed upper bound. 
Tar this point, we would like 
Pise and Cole (1971) were kind enough to 
ma available a prepublication copy of thet 
ù fe appearing in this issite. 
leve sedg | they point out what they be- 
thease, he somone of the present (A RG ) 
does hoi rey argue that a negative seare 
Score of pones same thing as d heen 
Sy me: the ENE magnitude. Apparenty, 
ample an that a negative score Of, for 
> —.50 does not mean that clustering 


to note that 


E(R) = M 


is midway between chance and minimum 
possible clustering (minR), whereas a positive 
score of .50 does mean that clustering is mid- 
wav between chance and maximum possible 
clustering (max). This is true.? However, the 
alternative would be to give unequal weighting 
to repetitions above and below chance. We 
believe that the ARC measure is more appro- 
priate because it weights equally repetitions 
above and below chance. For example, using 
the 3SE protocol, it can be shown that five 
bove chance yield a score of .50, 


repetitions al 
yield a score 


and five repetitions below chance 
of —.50. For this example, each repetition 


above and below chance receives a score of 


+.10. 
3 The reader w 
ARC score for an; 


ill note that the maximum negative 
y given protocol is: 

mink — E(R) 

maxk — E(R) 


Hence, whenever the opportunity for number of repeti- 
tions below chance is not equal to the opportunity for 
number of repetitions above chance, the maximum 
negative A RC score will not equal -L . i 
We might also note that in those instances in which 
attention is directed toward manipulations producing 
clustering below chance, 4 slight modification of the 
{RC measure can be made to yield negative ARC 
( NA RC) scores having the same properties as positive 
cores, that is, chance 15 fixed at 0 and the maximum 
Ese ‘pound is —1. The NARC measure 1$ computed 
ow 
E vs 
as follow E. a= BO 
NARC = TR) — min 
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TABLE 2 
Scores REPRESENTING CHANCE (Ch) AND PERFECT (P) CLUSTERING For EACH or Five MEASURES 
FOR EACH or EIGHT SELECTED RECALL PRorocoLs 
ARC MRR € D BBD 
^ Ch P Ch E Ch P Ch P Ch | P 
3 00 10.00 
.00 1.00 33 1.00 a3 1.00 .00 67 d 
T .00 1.00 33 1.00 33 1.00 .00 67 .00 Ee 
3SUs =:01 1.00 47 1.00 34 1.00 —.01 :66 —.10 aod 
3LU -00 1.00 41 1.00 da 1.00 .00 05 00 3 Xd 
OSE .00 1.00 17 1.00 Az 1.00 .00 83 00 1C ag 
6LE -00 1.00 17 1.00 AT | 1.00 .00 83 .00 p 
6SUs —.01 1.00 34 1.00 34 | 100 —.01 .66 —.10 7 bs 
6LU -00 1.00 29 1.00 -29 | 1.00 .00 71 .00 34. 


Frankel and Cole 
may be skewed in 
violating the norm 
of variance, How 
where that the 
tively insensitive 
(e.g., Winer, 1962 


also note that ARC scores 
their distribution, thereby 
ality assumption of analysis 
ever, it has been noted else- 
analysis of variance is rela- 
to departures from normality 


AA Furthermore, there are 
a number of transformations which may be 
used on the data to obtain normal distributions 
if desired (Burros, 1951). 

Frankel and Cole 


also propose a new measure 
of clustering, the Z 


Score. While this is one of 
the most mathematically sophisticated mea- 
Sures advanced recently, it nonetheless suffers 
Írom the same limitations of some of its 
predecessors, Specifically, it has no fixed 
upper bound, For example, using our protocols 
representing perfect clustering, the obtained 


Z scores were 5.31, 10.00, 4.93, 9.05; 7.52 
15.33, 5.03, and 11.56 à 


for protocols 3SE, 

SLE, 3SU, SLU, 6SE, GLE, 6SU, and 6LU 
respectively, i 
In conclusion, it h 


as been shown that the 
measure is 


free of the limitations of 
Previous indexes and provides an uncon- 


taminated Measure of relative amount of 
clustering in free recall, thereby allowing for 
Comparisons between and Within Subjects 

ments, However, it Should be 
> is an empirical 
and in no way presumes 


to reflect the mechanisms underlying organiza- 
tion in free recall. Until measures are de 
veloped which take such mechanisms p 
account, we recommend the use of the AR 
Score. 
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EMPIRICAL COMPARISON OF UNIVARIATE 


A? 


D MULTIVARIATE ANALYSIS OF 


VARIANCE PROCEDURES 


THOMAS J. HUMMEL! 


University of Minnesota 


AND 


JOSEPH R. SLIGO 


Ohio University 


In this Monte Carlo study, univariate and multivariate analysis of variance pro- 


cedures are used either 
analysis of multivariate norm: 
with respect to the error rates p 
average number of errors in experi 
when data generated under 36 
and proportion of varianc 
multivariate analysis of vi 
recommended as being 


A popular experimental design in educational 
and psychological research is the completely 
randomized design in which experimental 
Subjects are randomly assigned to treatment 
groups. Following the manipulation of some 
independent variable, measurements are taken 
On a single criterion, or dependent, variable. 
Phe resulting observations are frequently 
analyzed using a one-way analysis of variance. 
Tn a special case of this situation, where there 
are two treatments, the / test for two independ- 
ent samples is often employed. 

As a variation of the above procedures it is 
Common to use several criterion variables. 
Por example, after assigning some subjects to 
* group that receives reading Method I and 
Others to a group which receives reading 
Method II, an investigator may desire to 
Sather measurements such as reading speed, 
Comprehension, and retention. ; 
There are at least three approaches which 
an be employed to analyze data from a 
Multivariate experiment. One approach is to 
test each Hyg = plj = 1 to p) with a 
“Univariate analysis of variance. (In this 
Study it is assumed that there are two groups 
Measured on p variables.) This approach is 
men in the educational and psychological 

earch literature. 


[o 


author's doctoral 


up ANO 
is article is based on the first 
1gust 1969 at 


CORSO ; 
citation, which was completed Au 
Ra University, Athens, Ohio. 

Hy quests for reprints should be sent to 


di 
Thomas J. 


i" Inneso,., : A 5 p 
* €sota, Minneapolis, Minnesota 55455. 


ammel, Student Personnel Office, University of 


e in common were à 
ariance followed by univariate analyses of variance is 


the most useful of the three 
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singly or in combination to form three approaches to the 
al data. Comparisons among the approaches are made 
er comparison, experimentwise error rates, and 
nents with at least one error which were obtained 
combinations of sample size, number of variables, 


nalyzed. The approach consisting of a 


approaches. 


A second approach to the analysis of multi- 
variate data has been suggested by Cramer 
and Bock (1966). They have recommended 
that an overall, or multivariate, test be carried 
out on all the variables simultaneously by 
iesting the hypothesis of equal population 
mean vectors, /Jo:ui = us. (To this end, a 
statistic such as Hotelling's 7? could be used.) 
This hypothesis can be written as 


uu uia 
Hig: er |= ese 
Hpi Hp? 


Rejection of Ho:uı = us allows one to infer 
that uj Æ uj» is the case for at least one 
value of j. Following a rejection, Cramer and 
Bock (1966) recommend that univariate 
analyses of variance be run on each variable 
separately. 

Another approach has been suggested by 
Morrison (1967). Like Cramer and Bock 
(1966), he recommended that an overall test 
of Hui = us be carried out, but rather than 
employing univariate analyses following a 
rejection, he suggested using à simultaneous 
confidence interval procedure which was 
developed by Roy and Bose (1953). 

The first two approaches have both been 
questioned (Bock & Haggard, 1968; and 
Morrison, 1967, respectively) as being in- 
adequate because they do not proper ly fs 
count for the correlations that might exist 


among the P variables. 


p 
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What effect does the correlation among the 
variables have on tests of hvpotheses? The 
answer to this question involves error rates, 
Since there have been a number of different 
error rates defined by various authors, defini- 
tions of the two error rates which were of 
interest in this study are given here. 

Following Ryan (1959), the error rale per 
comparison is defined as the probability that 
any given comparison will be declared signifi- 
cant when in fact the null hypothesis is true 
for that comparison. The second error rate of 
interest, experimentwise error rale, is defined 
for experiments containing more than one 
comparison and is the probability that at least 
one comparison will be declared significant 
when in fact the null hypothesis is true for 
all comparisons. For these two definitions it is 
assumed that a is constant for all comparisons. 

To better understand the differences be- 
tween these error rates, one can think of an 
experiment being replicated a la rge number of 
times. For example, an experiment is carried 
out on two samples, and each subject is mea- 
sured on six variables. There are six compari- 
sons. If that experiment were repeated 1,000 
times, there would be a total of 6,000 com- 
parisons. If the experimenter knew that the 
Samples were being drawn from the 
multivariate population, he 
each time Hug = 
had been made. 


same 
would know that 
Hj» Was rejected, an error 
If 600 rejections resulted, 
the error rate per comparison would 
600/6000 = .10, 

If in 100 of these 1,0 
errors were made, and in another 100 exper 
ments two errors occurred, and in an addi- 
tional 100 a single error was made, errors would 
have been made in 300 of 1,000 experiments, 
and the experimentwise error rate would be 
300/1000 = .30. If all 600 errors had occurred 
in 100 experiments, the experimentwise error 
Tate would be -10. (Error. rates obtained in 
the manner described in this and the preceding 
Paragraph would represent empirical realiz 
tons of the two Probabilities defined 

Returning now 
that correl 
on testing 


be 


00 experiments three 


a- 
above.) 
now to the question of the effect 
ation among the p variables has 
hypotheses concerning group dif- 
ferences and recalling that a major function 
of inferential Statistics is 4o estimate the 
Probability of e 


the previou 
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following: What effect does the dependency 
among the variables have on error rates? 
If the variables are correlated, the comparisons 


based on these variables will not be independ- 


ent. When comparisons are independent it 15 
a rather casy task to obtain the probability 
of encountering an error in a comparison or an 
experiment. The former probability equals a, 
and the latter is obtainable from 1 — (1 — a)’. 
Whether or not the comparisons are indepen- 
dent does not affect the error rate per com- 
parison (Ryan, 1959), but dependency may 
affect the experimentwise error rate, When 
univariate analyses are the only technique 
used to analy multivariate experiments, 
the experimentwise error rate is affected and 
would seldom equal 1 — (1 — a)”. The actual 
probability would usually be unknown (Bock 
& Haggard, 1968). 

A researcher wishing to choose one of three 
approaches for analyzing multivariate data 
might desire answers to the following ques- 
tions. How do these procedures compare with 
respect to the magnitude of the two error rates 
associated with each? In what manner are the 
error rates affected by the degree of correlation 
among the variables, number 
involved, and sample size? The purpose of 
the present study is to Provide at least partial 
answers to these questions. "i 

Symbols employed in the Procedures section 
are here defined : 


of variables 


4 — à vector of coefficients of linear com- 
bination (here a has only one non- 
zero clement, which is equal to one) ; 

C = lower triangular transformation ma- 
trix; 

E = sample within-groups sums of Squares 
and cross-products matrix (with 
elements 33") 3 

S = sample within-groups variance-covari- 
ance matrix (with elements $35); 

POR 


population variance-coy 
(with elements Tjj); 

= sample mean vector for the ith sample 

G = 1, 2), with elements Z; where 


ariance matrix 


n 
j= 1to b; 

X — generated observations matrix, with 
clements 


Xj and rows a'i where 


t= 1,2;h = 1 to 50; and j = 1 t0 


Qo 9. 

95 
— vector 
ates 


ia 
l 


of independent, normal devi 


y Bin 


k 
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PROCEDURES 


The three 
E three approaches to the analysis of 
Malle are data which were studied em- 
ally are presented i fi i 
- S n the ; ara- 
graphs: e following para 


arabe ie Univariate analysis of variance. 

emplo ea ire a univariate F test was 

i, B ed to test separately each of the 

NS uis j — dto f, ata — .05. The null 

Eu ens was rejected at the .05 level of 
1cance if 


> Fosh nrn 


ejj nı + Me 


y+ Ny — 2 

Where 7 3 c 
Cotes Pais nı+ng—2 is the upper .05 point on a 
deg ia F distribution for 1 and ni ++ s — 2 

" Ces of freedom. 

4 2j . " r B . 
i ie i 2. Combination of univariate and 
app lvariale analysis of variance. In this second 
sir oach, a test of Mozu = H2 WaS carried out 

ng Hotelling’s 7? test where 


nyt 


Tr. Jae (qai a S 3). 


a m+ ne 
To: 
Hy = py was rejected if 


Te» (nti DPE 
Tmp p1 
“we .-p-1i is the upper .05 point 
= b central / distribution for P and nı + #2 
E 1 degrees of freedom. . 

3 po ve, univariate F tests, as described 
Separa: were carried out on each variable 
x ately. However, a rejection of Hon 
J= L to p, ata= .05 was allowed 
Pes T? value was significant. i . 
ance, bes 3 Multivariate analysis of vari- 
"quatit the third approach, a test of the 
Out ù y of the mean vectors was fi 
tr Using the 7? test of Approach 2 above. 
Statistical value was significant (a = 05), 
tiable g ests were then carried out on each 
vep © Separately. For this approach, how- 
Ing ea " Simultaneous / test was employed 
d of the univariate F test. Thus if 


03; p, nya p71 


Hjo 
only ir 


rst carried 


ER 


a 884 ç Oh 
n =o 
icone — p— do nns 


> F os;pnrinzP -b9 


the Ho:ujı = uj; was rejected. This decision 
rule, which can be used ov/y in the case of two 
groups and where comparisons are to be made 
on one variable at a time, is equivalent to 
rejecting Zo:uj = uj: if 


^ (s =. PETERE 
a’ (# — 33) — (« Sa siw) ME A i 


nə 


< a' (m — we) € a'(& — 3») 


PETRAM 
* (« Sa — —— | Tos p tns 


Hill» 


does not cover zero. This interval provides a 
simultaneous .95 confidence interval for any 
arbitrary linear combination of p variables 
(Roy & Bose, 1953). For the variable-at-a- 
time case, the approach utilizing the simul- 
taneous F test is equivalent to the one sug- 
gested by Morrison (1967, p. 126). i: 

The following paragraphs contain a step-by- 
step description of how the study was carried 
out. These steps actually provide a general 
description of the computer program which 
was written for this study. For the reader who 


is interested, the program can be obtained 


from the authors. 


1. A transformation matrix C was obtained 
transformed 


which, when postmultiplied by 2, 
s into x, a random vector observation from a 
multivariate A (0,2) popula- 
ecified by the investigator. 
Equations for calculating C have been pre- 
sented by Moonan (1957) and Scheuer and 
Stoller (1962). By adding the population mean 
the transformed vectors 
7s + p where v is now 
» = 0, therefore 


p-dimensional, 
tion, where X was sp 


vector u to each of 
Cz one obtains X = 
N(u,z). In this study 41 = # 
a= Cz. 

2. Fifty, nine-element xs were generated, 
and each x’ became à TOW in X. Executing 
this step twice produced Sample 1 and Sample 
2. 

3. The two Xs (each 50 by 9) were stored. 
The elements of E were such that for both 
Xs, all oj; were equal and all Tijn jz* J were 
equal. Data for the following nine experiments 


were obtained from à pair of X matrices: 


an= 10, p= 3 
p.i—10 p= 
c n210, p=9 
d.n= 30, p=3 
e n= 30, p= 6 


f. n= 30, p=9 
g.7-—50 p=3 
h.4250, p=6 
i. n=50, p=9. 


The first three steps simulate the following 
situation. An investigator has two random 
samples of n; = 10 subjects, each of which is 
measured on three variables. He carries out an 
experiment and then measures his subjects 
on three additional variables and carries out 
another experiment with p = 6. He repeats 
the process again, but this third experiment 
now has nine variables. Having finished three 
experiments, 20 additional subjects are ran- 
domly obtained for each group so that 2; = 30. 
In a manner identical to the case where 1;— 10, 
three more experiments are carried out where 
alternatively p = 3, 6, and 9. Six experiments 
have now been completed. Sample size is 
increased to n; = 50, Again, three more experi- 
ments are performed where P = 3, 6, and 9. 

Nine experiments were performed, each 
under a different combination of n; and $- 
Each experiment was analyzed by the three 
approaches described previously, and the 
errors made by each procedure were recorded. 
Since some of the data in the X matrices were 
used in more than one experiment, the com- 
plete independence of these experiments 
cannot be assumed. This overlap was neces- 
sary due to cost considerations, 

Only rejections of Ho:uji = uj were counted. 
The overall test of mean vectors did not enter 
into the numerical calculation of error rates; 
rejection of Hom = po, however, was a 
Prerequisite for counting rejections of 
= Hy in the second and third 
This was done so that 
decisions of the same ty 
each approach, 
of little theoreti 
only that two 
Often one will 


Ho: By 
approaches, 
an equal number of 
pe would be made by 
(Additionally, it is frequently 
cal or practical value to know 
mean vectors are unequal. 


Want to know on which vari- 
able[s] differences exist.) 


4. Steps 2 and 3 were re 
5. Four sets of 
Steps 1-4 were 
ferent random r 


peated 1,000 times, 
experiments as described in 
analyzed, For each set, a dif- 
number was used to initialize 
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the random number generator? and mos! 
important, each run was made with $ dif- 
ferent population covariance matrix. X was 
varied so that c;;?/a;;sp; = pjj?, the propor- 
tion of variance in common between the jth 
and j'th variable, was equal to .10 in the d. 
set, .30 in the second, .50 in the third, and .7 

in the fourth. 


In summary, there were 36 different com- 
binations of levels of ;1;, p, and proportion of 
variance in common (3-3-4 = 36) used in 
this investigation, and 1,000 experiments 
were carried out under each combination 
The error rates presented in this study are 
based on the number of errors which were made 
when each of the 36,000 experiments was 
analyzed by the three different approaches: | 
(These error rates are, of course, point esti- 
mates of theoretical probabilities; however, | 
in the following sections modifiers such as 
estimated and obtained are not used.) 


FINDINGS AND INTERPRETATIONS 


Table 1 contains the error rates per com- 
parison and the experimentwise 
which were obtained by 
multivariate data with univariate analyses of 
variance (Approach 1). The error rates per 
comparison range from .040 to 055, and no | 
consistent trend seems to exist here. Theo- 
retically, one would expect these values to be 
close to .050, and this seems to be true. 4 

In the lower half of Table 1 are the experi- 
mentwise error rates, Theoretically, in the case 
of py? = 0, that is, no variance in common; 
one should observe rates of 1 — (1— a)”, 
and for the opposite extreme of complete | 
dependence, Pij? = 1.0, the theoretical value 
would be a. In this study, four points have 
been estimated between -00 and 1.00 propor: 
tion of variance in common. For purposes o 
comparison, the experimentwise error rates 0 
Table 1 have been graphed in Figures 1, 21 
and 3. It can easily be seen in Figure 1 that th^ | 
€xperimentwise error rates 


error rates. 
analyzing samples of | 


increase as 
*The multivariate data generator used ANORM: 

an entry point in RANDNR, to generate the element 

of z. This subroutine is in the Ohio University Comput? 


Center library. It Was programmed by Larry Irwil 
Department of Mathematics, Ohio University. 


" 


T. WE 


f 


a decrease as proportion of vari- 
2 M ape (0;j?) increases. In Figures 
respectivel Wee on m;— 30 and 2, = 30, 
less of a: 5 ic same trends appear. Regard- 
Tid for p E there is always less fluctua- 
B enc " x he largest fluctuation due to 
Ev Ee occurs with p = 9 and 
drops er er that circumstance the rate 
S inc 34e to .129 as proportion of vari- 
Table pomon increases from .10 to 10. 

Den a secre the error rates obtained 
varinte ed bination of multivariate and uni- 
ployed E RUM (Approach. 2) was em- 
per en; ege the data. The error rates 
Tange eq are fairly homogeneous and 
always se bos to 022. These values are 
«rer I than: their counterparts in Table 
for tice S consistent but very slight trend 
"luetuati p. to decrease as P increases. 
ini are generally less than .010 and, 

, would seem to be of little practical 


En, TABLE 1 
"RROR RATES 5 
p Apes PER COMPARISON AND EXPERI 
` ve 
s RATES FOR ANALYZING MULTIV 
JATA WITH UNIVARIATE ANAL 
OF VARIANCE 


—— 
Sample ae ees Proportion of variance in common 
variables - xci 
E 40 .30 50 «70 
EL. No | 
Error rate per comparison 

10 * 
10 3 043 | .053 | .040 
10 6 44 | .055 | .040 
30 9 6 | .055 | .043 
30 3 051 | .050 | -043 
30 6 1052 | .047 | -040 
50 9 ‘953 | .046 | -040 
50 3 ‘950 | .043 | 08 
50 6 ‘ous | 045 | 0H 

E 2 49 | .037 | -047 

E- Experimentwise 
ij [| 
10 3 44s | 312 | 114 077 
10 6 26; | 190 | 78 | 41 
30 9 ag | 247 | 209 | -129 
30 3 "us | an | au | -085 
30 6 ‘225 | 200 | 476 | 15 
B ? 206 | 263 | .223 | -140 
20 3 ass | 3124 | .102 | -083 

m 6 230 | 190 | .160 | -115 

Eel 324 | 258 | .208 | .146 
Jl. ms 
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EXPERIMENTWISE ERROR RATE 
5 S 


10 30 .50 70 


PROPORTION OF VARIANCE IN COMMON 


Fic. 1. Relationship between experimentwise error 
rate and proportion of variance in common for uni- 


variate analyses of variance (n; = 10). 


importance. Likewise, the experimentwise 
error rates are reasonably homogeneous. No 
consistent trends appear in the data. This is 
in sharp contrast to Table 1. The values in 
Table 2 range from 017 to .050, As in the 
case of the error rates per comparison, these 
values are always lower than their counter- 


parts of Table 1. 
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EXPERIMENTWISE ERROR RATE 


Ale) 30 50 70 
PROPORTION OF VARIANCE IN COMMON 


Fic. 3. Relationship between experimentwise error 
rate and proportion of variance in common for uni- 
variate analyses of variance (n; = 50). 


Table 3, for the most part, is homogenous 
with respect to error rates. The completely 
multivariate approach (Approach 3) allows 
for few errors. A slight but fairly consistent 
trend is for this approach to become more 
stringent as increases. For the case of p = 9, 
12,000 experiments were carried out. Only 1 
of these experiments had an error in it. There 
are 72 error rates presented in Table 3. In 
no instances did these error rates exceed the 
Corresponding rates in Table 2, 

Table 4 was not a part of the computer out- 
put, and originally no plans were made to 
collect such dat: 


a. However, inspection of the 
data suggested that it would be interesting 


to calculate the average number of errors 
made in experiments with at least one error, 
Only the univariate approach (Approach 1) 
and combination approach (Approach 2) 
are presented; the strictly multivariate ap- 
Proach (Approach 3) allows for so few errors 
that it was not included. With few exceptions, 
the average number of errors in experiments 


with at least o; i 
S ne error increases as a functi T 
of both p eA 


and proportion of variance i 
a n com- 
E Sor is univariate case, the highest 
à "rd in was obtained under n;= 10, 
T Les ees of variance equal to 
10. e combination . a -d 
inus. xj approach, under 
: | UPC Proportion of variance in 
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common equal to .70, the highest averag 
number of errors was reached, 3.950. Th 
combination multivariate and univariate | 
proach consistently produced higher wm. 
than the univariate approach. In genera 
the discrepancy increased as p increased. 

"Table 5 presents the error rates for | 
overall 7? test. While these rates by themselv@ 
were not of primary interest in the study, the) 
were computed and are here presented ^ 
further evidence of the validity of Hotelling 
T? test. 


Discussion AND RECOMMENDATIONS 


The third approach, overall multivariat! 
analysis of variance followed by simultancol! 
F tests, is discussed first and attention is the” 
turned to the other approaches. As the num 
ber of variables increases, this completely 
multivariate approach becomes very con 
servative and would seem to be useful onl) 


TABLE 2 
Error Rates PER COMPARISON AND Ext 
Error RATE 
Data WITH MULTIVARIATE 6 
VARIANCE FOLLOWED BY UNIVARIATE 
ANALYSES OF VARIANCE 


;RIMENTWIS 


Proportion of variance in common 
Sample | Number of |. 

size | variables E 

10 30 .50 gn 

Error rate per comparison - 
10 3 018 | .012 | o1g | .012 
10 6 010 | .009 | .012 | .007 
10 9 .009 | .007 | .008 | .005 
30 3 017 | .022 | .014 | 014 
30 6 011 | 013 | .o11 | .009 
30 9 010 | .012 | .009 | .009 
80 3 018 | 021 | 016 | 017 
50 6 011 | .013 | i010 | 01 
50 9 008 | 011 | oos | .00? 
| an 
Experimentwise 

10 3 044 | 029 | oss | O7 
10 6 46 | 029 | ‘030 | -015 
10 9 050 | 026 | s | 07 
30 3 037 | i | ‘029 | 0% 
30 6 037 | 037 | 032 | -0% 
30 9 042 | 042 | .030 0 
50 3 038 | 041 | .033 ot 
50 6 937 | 039 | .028 | OG 

50 9 036 | 1038 | .026 | ^ 
ee diee [077 77 e 


| 
|. 
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when the cost of a Type I error is extremely 
high. The ability of this approach to find 
differences when they actually exist would be 
questionable. If used in practice, the following 
anomalous situation would frequently arise. 
The researcher would reject Mo: = jis and 
infer that Mo:mı ¥ us. However, the applica- 
tion of the simultaneous F^ tests would result 
in none of the HMoiuj = uj J = 1 to p, being 
rejected. The researcher would thereby con- 
clude that the vectors were unequal but that 
the groups did not differ on any of the vari- 
ables, (It should be mentioned that this 
Situation can also arise with the combination 
approach, but not as frequently.) 

Assuming that a researcher does not wish 
to use an extremely conservative technique, 
he still has two approaches from which to 
choose: (a) univariate analyses of variance 
and (b) multivariate analysis of variance 
followed by univariate analyses of variance, 


TABLE 3 


Error RATES PER COMPARISON 
Error RATES FOR ANALYZING MULTIVARIATE 
DATA WITH MULTIVARIATE ANALYSES oF 
VARIANCE FOLLOWED BY 
SIMULTANEOUS 7 TESTS 


AND EXPERIMENTWISE 


Sample Number ai Proportion of variance in common 
^ variables : 
A0 30 50 40 
Error rate per comparison 

SS 
10 3 | .oo3 | .002 | -005 .003 
M 6 ‘oo | .o00 | .000 | -000 
a 9 ‘ooo | .000 | .000 .000 
30 3 ‘oo6 | .003 | -002 004 
30 6 ‘ooo | .001 | .000 .000 
29 9 ‘yoo | .000 | .000 .000 
a 3 ‘007 | .004 | -004 .003 
2 6 ‘ooo | .000 | -000 .000 
ath 9 ‘ooo | ,000 | 000 | -000 

I Experimentwise 
n 3 010 | 005 | .012 .006 
10 6 ‘ooo | .000 | -003 001 
30 9 ‘oor | .000 | .000 .000 
30 3 ‘or7 | .010 | .004 .010 
30 6 ‘ooo | .003 | 001 | 201 
50 9 ‘ooo | ooo | .000 | .000 
50 3 022 | o11 | .010 | -008 
50 6 002 | oot | 001 | -002 

i 9 .000 | .000 | -000 .000 


on 
or 


TABLE 4 


AVERAGE NuMBER OF ERRORS IN THOSE EXPERIMENTS 
HAVING at Least ONE ERROR 


Sample | Number of Proportion of variance in common 
size variables 
10 .30 .50 Ej 
Univariate analyses of variance 
10 3 1.110 | 1.152 | 1.404 | 1.545 
10 6 1.206 | 1.400 | 1.843 | 2.180 
10 9 1.376 | 1.676 | 2.359 | 2.984 
30 3 1.148 | 1.286 | 1.282 | 1.518 
30 6 1.253 | 1.570 | 1.591 | 2.087 
30 9 1.385 | 1.825 | 1.870 | 2.593 
50 3 1.123 | 1.218 | 1.275 | 1.566 
50 6 1.274 | 1.505 | 1.681 | 2.304 
50 9 1.404 | 1.702 | 2.029 | 2.884 
Combination of multivariate and 
univariate analyses of variance 
10 3 1.205 | 1241 | 1.543 | 1.591 
10 6 1.326 | 1.828 | 2.433 | 2.529 
10 9 1.600 | 2.385 | 2.840 2.722 
30 3 1405 | 1.585 | 1.414 1.640 
30 6 1.757 | 2.135 | 2.063 | 2.571 
30 9 2.143 | 2.571 | 2.633 | 3.714 
50 5 1.895 | 1.537 1485 | 1.857 
50 6 1.730 | 2.077 | 2.071 2444 
50 9 2.000 | 2.526 2.846 | 3.950 
the combination approach. It might be tempt- 
ing to state simply that the combination 
he uni- 


s more conservative than t 
and that a choice between 
ased on the degree of 
avenue is not taken 


approach i 
variate approach 
the two should be b 
conservatism desired. This 
TABLE 5 

yOR THE MULTIVARIATE 


Error RATES 
p VARIANCE 


ANALYSES © 


Proportion of variance in common 
Sample Number of 
size variables 40 E E P 
ne e en 

10 3 44 | 040 | -048 045 
10 6 “065 | .050 | -053 047 
10 9 i069 | .046 | -046 .045 
30 3 ‘os | .056 | .047 043 
30 6 i46 | 051 | .049 053 
30 9 48 | 055 | OH 051 
50 3 ja; | 053 | -049 047 
50 6 i3 | 058 | 058 .053 
a0 9 “ou | o6 | 046 | 05 
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here for two reasons: (a) the combination 
approach is not extremely conservative; and 
(b) there is an important difference between 
the univariate and the combination approaches 
which can serve as a basis for choice. 

With the univariate approach one typically 
does not know the experimentwise error rate, 
whereas with the combination approach one 
can effectively estimate and control its value. 
There are only two instances when the experi- 
mentwise error rate is known for the univariate 
analysis of multivariate data, and that is when 
the comparisons are either completely de- 
pendent or completely independent (p; .00 
or pj? = .00, respectively). When univariate 
analyses are employed and the variables are 
uncorrelated (given that the null hypothesis 
is always true), the experimentwise error 
rate is obtainable from 1 — (1 — a)». When 
nine independent comparisons are carried out 
in an experiment the error rate is .37. If the 
experiment were replicated 1,000 times, 370 
experiments would be expected to contain 
at least one error. In addition, if a = .05 for 
each of the 9,000 comparisons, 450 of them 
would be expected to be significant due to 
Chance. On the average, there would be 450/ 
370 — 1.216 significant comparisons in those 
experiments with at least one error. Obtaining 
three or more significant comparisons due to 
chance alone would be a rare event (.008). 

However, this is not the case when the tests 


y among 
If complete dependency 

àn experiment are 
ficant., In practice, 
and approximately .70 
three errors would fre- 
aining 
ance, seeing 
would not se ud nt 
fort. This sit 


increases, and with it 
errors, experimenty 
Even so, when the pr 
common is .70 and p = 9 
in Table 1), the expe 


rate decreases, 
of variance in 
(collapsing across 12; 


"™Mentwise error rate 
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was close to .14. An average of almost three 
errors was made in approximately 14% of 
the experiments. In analyzing multivariate 
data with the univariate approach, a rather 
confusing state of affairs exists. The probability 
of at least one error fluctuates directly with ? 
and indirectly with the proportion of variance 
in common. Experimentwise error rate seems 
especially important in this case because the 
errors occur in sets. The average number of 
errors also fluctuates with p and proportion 
of variance in common. One may well wonder 
whether he should be impressed when 4 tests 
are significant out of 11, especially if his 
univariate F test computer program does not 
provide a correlation matrix. The discussions 
of error rates have a certain neatness Whenever 
independence is assumed, but introducing 
dependence makes the situation rather chaotic. 

Compare with the above a situation in 
which experimentwise error rates remain 
relatively constant despite fluctations in p 


and proportion of variance in common. The 


combination approach oflers such consistency. 
The errors still occur in sets, and slightly 
larger sets than in the univariate approach 
at that, but the probability of a set of errors 
occurring is appreciably less for this approach, 
and most important, the probability can be 
accurately estimated. For the situation de- 
Scribed above where p=9 


and proportion 
of v 


ariance in common was -70, the average 
number of errors in experiments with at least 
one error was approximately 3.5, but the 
experimentwise error rate was .02 rather than 
-14 (again collapsing across ni). 

The above discussion leads to the following 
recommendations: 


1. The use of the univariate 


approach with 
multivariate 


data should be discouraged: 
particularly as p and proportion of variance 
in common increase, The grouping of error? 
and the generally u 
error rates can easily 
tions of findings. 

2. The combination approach results in a? 
experi imentwise error rate which is reasonably. 
Consistent, regardless of P and proportion 9 
variance in common, This procedure doe 
not eliminate the occurrence of a group [ 
errors, but it keeps the probability of a grou? 
occurring at a low level. This approach ! 


allow for misinterpret® 


nknown experimentwist | 


TH 
: 
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recommended for its consistency and because 
Its conservatism is not extreme. 

. 3. The completely multivariate approach 
is recommended only for those who desire 
an extremely conservative test. In the form 
used in this study, this approach seemed to 
Control for grouping of errors as a function of p, 
but not as a function of proportion of variance 
in common. This last point is stated cautiously 
Since the extreme conservativeness of this test 
made it difficult to observe trends. 
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Recent studies with deaf subjects are reviewed, 
learning, logical symbols, Piaget- 


emergence of cognitive 


an unfavorable environme: 


. The reviewed data challenge 
ual development largely upon lin: 


rm Piaget’s operative theory. 


Since Publication of the 
Bulletin article 


new studies 2 


1964 Psychological 
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der, Starting 
have been re- 
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“thinking.” As an 

iz Rule Learning 


with deaf children 1. Blank and B 
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2. A veteran of German Denkpsychologie 

veigl, investigated another type of rule 

learning in the absence of verbal encoding, A 
g. 


series of 11 tasks was administered by means 


TABLE 1 


Task and age 


Study: author and number 


Topic 


Similar 


Performance of deaf in 
comparison to hearing 
subjects 


Inferior or 
different 


Rule learning 


15-19 


1. Blank & Bridger, 1966 
| Weigl & Metze, 1968 
. Pufall & Furth, 1966 

. Youniss & Furth, 1966b 
. Andre, 1969 

. O'Connor & Hermelin, 
1965 

7. Furth, 1964a 

8. Furth & Pufall, 1966 
9, Goetzinger et al., 1967 
0. 

1 


ESI 


an 


. Odom & Blanton, 1967 
. Michael & Kates, 1965 


12. Furth, 1966a 
13. Furth & Youniss, 1965 
14. Youniss & Furth, 1967 


15. Youniss & Furth, 1965 

16. Youniss & Furth, 1966a 

17. Piaget, 1966 

18. Furth, 1966a 

19. Youniss, 1967 

20. Ross, 1966 

21. Robertson & Youniss, 
1969 

22. Youniss & Robertson, 
1970 

23. Furth & Youniss, 1969 


24. Lantz & Lenneberg, 1966 
25. Youniss & Furth, 1966b 
26. Withrow, 1968 


27. Rozanova, 1966 

28. Ross, 1969 

29. Goetzinger & Huber, 1964 
30. Conrad & Rush, 1965 

31. Conrad, 1970 

32. Oléron & Gumusyan: 1964 
33. Yashkova, 1966 

34. Gozova, 1966 
35. Suchman, 1966 
36. Carrier, 1961 

37. Costa et al., 64 
38. Stoy va, 1965 

30. Furth, 1961 


Cross-modal transfer 
Rule learning 
Double alternation 
Discontiguity 
Reversal shift 
Visual rules 


Combinatorial sequences 
Combinatorial sequences 
Raven’s matrices 
Sequential rules 

Social concepts 


Logical symbol use 
Symbol (a) use (b) discovery 
Symbol use 


Transitivity 
Transitivity 
Conservation tasks 
Quantity of liquid 
Seriation 
Probability 
Anticipatory images 


Anticipatory images 
Formal operational tasks 


Color recognition 
Recognition strategies 


cessive 
Pictures 
Symbol span 
Design reproduction 
Type of encoding 
Type of encoding 


Embedded figures 

Figure reversal 

(a) recognition (b) drawings 
Preference (a) color, (b) form 
Color-weight association 

RT (a) visual (b) bimodal 
EMG and REM in sleep 
Size-weight illusion 


Sequence (a) simultaneous (b) suc- 


x 
X (a) 


X (15, 16) 
X 
x 


X 9-11) 
X (a) 
X (a) 
x 
X (a) 
x 
x 


X (8,6 


MA 


(2 tasks) 
(11, 13) 


X (9) 


X (6) 
X 
X (b) 


X (9, 10) 


Fa 


of gestures; the tasks required successive pair- 
wise selection of extreme values on a given 
dimension (largest, smallest; heaviest, light- 
est; etc.), including shifting to a different di- 
mension on identical objects and adding or 
subtracting of elements in a given tower to 
make it in one case the highest or in another 
the lowest. As an example of optimal perform- 
ance in children without knowledge of a so- 
cial language, Weigl and Metze (1968) re- 
ported that 16 of 34 profoundly deaf children, 
ages 4 to 6}, succeeded on all given tasks by 
readily transferring the principle of extreme 
values to new problems. The authors consid- 
ered these results to demonstrate the possibil- 
ity of nonverbal logical-mathematical thinking 
with implications reaching beyond psychology 
into epistemological, linguistic, and educa- 
tional theory. 

3. Pufall and Furth (1966) observed yet 
another rule behavior which is traditionally 
supposed to depend on verbal cues. They ad- 
ministered a series of double alternation tasks 
that tested for generalization and stability of 
the acquired principle. Subjects were deaf 
fand hearing children in an age range of 4 to 
9 years (N — 12 at each age). At age 4 and 
again at ages 7 and 9 there were no differences 
between the groups; however, hearing children 
were somewhat better than deaf children at 
age 5 on acquisition and at age 6 on transfer, 

4. Two experiments were reported by You- 
niss and Furth (1966b, Pp. 9-13) in which 8- 
year-old deaf children were found to perform 


€ discrimination learn- 
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children, both hearing and deaf, familiarit 
of stimuli made reversal more difficult; (d) 
hearing and deaf children performed alike. 
These three studies, taken together, sugges 
that the processes which bridge spatial and 
temporal discontiguity are effectively similat 
in linguistically deficient and linguistically 
competent children. 

5. In order to test the hypothesis that 


verbal mediation may facilitate reversal, An- 


dre (1969) observed 7- and 12-year-old heat 
ing and deaf children (N = 96). The results: 
showed, as expected, that given an option 3 
greater proportion (about 75%) of older than 
younger children (about 54%) chose reversal; 
but the main hypothesis was contradicted by 
two facts. The linguistically deficient deaf 
children performed consistently like the hear- 
ing controls, and reversal performance oc 
curred as frequently with those hearing sub- 
jects who were able to verbalize correctly as 
with those who verbalized incorrectly. 

6. O'Connor and Hermelin (1965) devised 
a series of tasks that were intended to be 


visual-spatial analogies of operations required \ 


in linguistic competence, These tasks were 
ordering or seriation, rule discovery involving 
both sight and touch, immediate m 
perceptual matching, 
imbecile experimental 
and normal control 


emory, and 
Besides psychotic and. 
groups, aphasic, Edi 
groups, ages 10 to 11 


manipulate signs and symbols is not 
dent on speech or verbal comprehensi 
that consequently the logic of thinki 
competence in verbal expressions deve 
atively independently, 

7. Two studies focused directly on differen- 
tiation of combinatorial sequences as a crucia! 
prerequisite for successful learning; for in- 


stance, the subject had to learn four different 
responses to four thr 


ABB, AAB, BAA, BBA. Tt w 


derlies linguistic 
three groups 0 
hearing, deaf, and 
The first study 


depen- 
on and 
ng and 
lop rel 


as hypothesize! 


mean age of 10, and perform- 


I 
i 
d 
d 
f 
1 


= 12 in each cell), were employed. These j 
three control groups succeeded equally well on 
all tasks. It was concluded that capacity to 


y 


= 10 in each group: , 
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M E omnes visual paired-associ- 
iene E e sequence task was made up of 
D. sti p forms ABCD which generated 
ince: mulus pairs AC, BD, AD, BC. Re- 
Cards n consisted of pointing to one of four 
C bes 1 to A. A maximum of 100 
success Lge was given with a criterion of 
BE eti nine correct responses within 10 
tation i" trials. With two modes of presen- 
tion in i results suggested a slight. retarda- 
oth in ie aphasic but not in the deaf group, 
s. simultaneous and successive presenta- 

$ M e secon study (Furth & Pufall, 1966) 
! aphasic ee the suggested inferiority of 
Were seb edt Children at two age levels 
group i» ied, 6 and 10, with N = 16 in each 
impaire 3 the visual task the younger hearing- 
Perform A pin deaf and aphasic) children 
mma somewhat below the controls on 
alike: ive sequences, but the older performed 

| etene iy the simultaneous condition, no dif- 
auditory were noted. For hearing children, 
| tently i4 Successive presentation was consis- 
Á asier than visual presentation. The in- 


e 

eae of the hearing-impaired groups 
E to general lack of linguistic ex- 
o traing insofar as natural speech can provide 
f & ng in the ability to master sequentially 


1 Presented combinations. 
Bice ational data on deaf children’s per- 
Matrices on Raven's Standard Progress? 
and D, es were provided by Goetzinger, Wills, 
ekker (1967). They reported a mean of 
lage a significant increase to 36.0 at 
l far after 3 months) at a mean age o 
AA = 96), an apparent lag of 13 to 2 
6 ompared to the published norms. When 
ps children, around 183 years of age, 
"irst ested, two interesting results emerged. 
Wi des Raven's score Was TOW 45.1, well 
e normal limits, and second, the reading 
9f the older groups indicated less than 6 


: Mont 
hs advance over the younge 
lts that the 


E 

DER 4 
slight inferred from these rest 
vement could hardly 


Xplaj 
a] 
i the poor 
up could 


r group. The 


linguistic impro 
b Dian. improved Raven's score; 
i e e of the younger age gro 
| hee, be attributed to lack of general experi- 


10. Odom and Blanton (1967) hypothesized 
that deaf children may have difficulty remem- 
bering the temporal order of unfamiliar 
material because sequential information is not 
a relevant dimension of their linguistic ex- 
perience. Groups of deaf children about 17 
years old and two hearing control groups (one 
equated on age, the other of age 11 equated 
on reading achievement) were shown 20 pairs 
of letters or of nonsense forms, in successive 
(SUC) or simultaneous (SIM) conditions. 
The sequential order of the two elements 
within these pairs followed the rule that five 
elements were always in the first place, while 
five others were always in the second place. 
Subsequently à recognition task was given 
which included new combinations of the fa- 
s. Some of the new combinations 


miliar element 
ove rule, others were 


were according to the ab 
reversed. It was expected that deaf persons 
would be less prone to make recognition er- 
rors due to sequential rule learning (sys- 
tematic errors) than errors of sequential re- 
versal (reversal errors). In terms of overall 
results their performance fell between the two 
control groups: They correctly identified 14.9 
pairs (versus 15.7 and 14.1 for the two hear- 
ing groups) and made 3.1 systematic errors 
(versus 3.4 and 3.3) and 1.7 reversal errors 
and 2.0). Compared to their age 
controls, the deaf persons made more reversal 
errors particularly on SUC with nonsense 
forms. In sum, linguistically deficient deaf 
persons formulated implicit rules pertaining to 
sequential order in a manner similar to hear- 


ing persons but not to the same extent. 
11. Michael and Kates (1965), with a se- 
f 20 deaf seniors from the 
and two hearing controls, 
equated by age and scholastic standing, found 
otable differences between 
y task which re- 
of familiar social 


(versus 1.1 


ying social situations with 
ge, or clothing. The task 
er the pictures that were correct 

i “focus” picture. It was 


pictu 
different affe 
was to discov! 


instances of a given ; i | 
concluded that the deaf subjects 1n this sam- 
cial attitude as 


ple manifested as mature à SO 


the controls. 
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Logical Symbols 


12. Chapter 14 of Furth (1966a, pp. 215- 
222) described a teaching demonstration of 
logical thinking, based on symbolized con- 
cepts of classes and logical connectives 
matched against pictured instances. Three 
groups of 16-, 13-, and 9-year-old deaf chil- 
dren participated. After 4 days of teaching, 
even the youngest deaf children succeeded in 
Some symbolic conceptual situations. A fol- 
low-up Study indicated that the performance 
of the 10 deaf children in the youngest age 
group was comparable to a class of 11 hearing 
peers, The following are the results in terms 
of mean percentage of children who succeeded 
on all three items of a given logical operation 
with hearing scores Preceding those for deaf 
children on each type of operation: negation 
(73, 78); conjunction (78, 82); affirmation- 
simplification (82, 56); disjunction-simplin- 
cation (55, 67); disjunction-exclusion (36, 
33; disjunction-addition (27, 22); conjunc- 
tion 1-negation (73, 44); conjunction 2-nega- 
tions (45, 67); negated conjunction (18, 33); 
negated disjunction (27, 22). 

13. The above method was derived from 
an experimental procedure first devised by 
Furth and Youniss (1965) to test logical 
thinking by means of a series of symbol cards 
(containing ymbols for affirmation, 

; and exclusive disjunc- 
nstance cards illustrat- 


E. k n symbol use were 
ound to discovery task that 


or 13- to 14-year- 


old hearing youngsters, as reported in great 
detail by Youniss and Furth (1964). 

As the acquisition task on symbol discovery 
did not by itself require learning of the me 
ing of the symbols, success on symbol da 
covery could only be achieved if subjects 
took the initiative in discovering the mead 
ing of the logical symbols. It is in the n 
covery aspect of thinking that the E 
subjects apparently failed. Following ud 
Suggestion that experiential deficiency may 
result in lack of intelligent initiative or mo 
tivation, the authors sampled a rural popula 
tion of a lower socioeconomic strata a 
observed that the performance of the rural 
Sample was similar to that of the deaf san 
ple: They were equal to controls on symbo 
use, but failed on symbol 
results seem to confirm the proposition that 
a deaf person's failure on nonverbal thinking 
tasks is less related to linguistic deficiency a$ 
such than to lack of a stimulating social and 
experiential environment, 
14. Continuing in the line of the 
vious investigations, 
(1967) trained 24 
different (America 


two pre- 
Youniss and Furth 
deaf, 24 hearing culturally 
n Indians), and 24 control, 
young adolescents jn five sessions of logic 
learning, After training 

their ability to 


identify newly introduced 
drawings as true 


or false instances of sym- 


bolic Concept statements. On the more com- 
plex concepts, de; 


inferior to the c 


Piaget-Type Tasks 


15. Youniss and Furth (1965) investigated 
by nonverbal means the manifestation of the 
ogical principle of transitivity (A> By B 
C; therefore A > C). Employing an incline 
plane, they first taught the child that a ball 
of Color A would Push down a ball of Colo 
B, but not vice versa, and similarly that Bal 
B pushed down a third Ball C. Subsequently: 


| 
discovery. 1 


they were tested on , 


| 


a 
elie. E uis cu of deat and hearing 
‘would push di Rn ME predicted that Ball A 
dren at oe own Ball C, while hearing chil- 
any partic ane to 6 showed no preference for 
| Was inter did pression. This performance 
| ous Ee ^ 2 as indicative of the spontane- 
children in rder which older, but not younger; 
i A apose on a neutral event. 
1966) E wien study (Youniss & Furth, 
| ice the former issue further by 
mension ns E as an added irrelevant di- 
* smaller | p in the incongruent condition 
oth cases po. would push a bigger ball. In 
the critical pii ct the color of the ball was 
in De ariable. With hearing children 
ound that na Grade hs and Grade 3, it was 
Y the Mage ei predictions were made 
DS] con aru r children irrespective of percep- 
Performed IGHcys but the youngest age group 
When D toa logical pattern only 
Deaf n is congruent with logical grouping. 
hb. we comparable in age to children 
S far as lo a miade in general more errors, but 
Derformeq = grouping was concerned, they 
and the f ike the hearing children. Both this 
j Bipporting pine study were interpreted as 
a concret Piaget’s theory of development 
nd penden operational thinking as largely 
7. Di nt of linguistic competence. 
à pilot "e (1966) reported the results of 
F, to opmental study carried out by 
Various ter. Twelve to 26 deaf children at 
tas ah ies levels were employed for each 
of Üns tested with a clay ball for judgments 
ol j eon Similar to results with con- 
ai ve qui order of attainment oc- 
» Years) deaf children regarding substance 
(IS), irren (9 years), and volume (11 
gj ation ese results indicated a slight re- 
ive compared to a hearing control group 
fon, 2 procedure; however, in con- 
a great surface area and volume there 
ora, Irface), difference (103 versus 8) years 
, while conservation of quantity 
Aude Yt Pani tested a substantial retardation 
ed in th Sears (113 years). Piaget con- 
ese words: 


i ar iun manifest a retardation in opera- 
vit lation ^. pp. notable and the differences 
[o are due rather to the procedures 
problem is communicated. While there 
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may exist a certain a i 

to linguistic m Nope: Bue 
to consist more in stimulating one EAE 
cid and in facilitating social mobility rather 
[ies pue about the structures of operations 

18. Similar results and conclusions were ob- 
served by Furth (19662, pp. 121-124; 155- 
158) on judgment of conservation of amount 
of liquid with deaf, control, and culturally 
different children. The results indicated a 
considerable time lag between the hearing and 
the deaf, so much in fact, that performance 
could not be taken as a valid manifestation 
of intellectual level in the deaf group. These 
were the results: On a logical operation which 
Piaget and others report as mastered by the 
age of 7, deaf children did not generally suc- 
ceed until age 16, children in a rural area 
until age 11, and hearing controls until age 
9, Furth suggested that these discrepancies 
were due primarily to differences in experi- 
ence and test attitudes that influence per- 
formance on Piaget’s tasks. 

19. A series of investigations was reported 
in Youniss and Furth (1966b, pp. 32-40) and 
in Youniss (1967) concerning spontaneous 
logical rules imposed by the child in handling 
concrete material. The tasks were seriation, 
correspondence, reverse correspondence, sort- 
resorting, multiple seriation, and 
multiple classification. In a comparison of the 
performances of deaf children, ages 6 to 9, 
with hearing controls a considerable lag in 
spontaneous logical behavior was observed. 
With additional task information and demon- 
stration, the logical performance of deaf chil- 
dren approximated the level of hearing chil- 
dren. On the basis of these and other studies, 
a discrepancy between intellectual capability 
and performance was hypothesized; in other 
words, for some reason deaf children did not 
readily utilize logical operations that were 


otherwise shown to be available. 
mpared hearing (ages 


ing and 


20. Ross (1966) col 
7 to 15) and deaf children (ages 11 to 15) 
in which children were 


ability task 
guess the outcome of objectively 
! ounger deaf children 


While y 
be slightly poorer, the older 
deficient, 


still linguistically 


on a prob 
required to 
varying odds. 
were found to 


gh 
ones, thougl 
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caught up with the hearing children. Ross 
Suggested that “direct exposure to and com- 
merce with probability situations affords suf- 
ficient opportunity for the attainment of 
Probability competence even in the absence 
of concomitant improvement in linguistic 
skills [p. 925].” 
21. Two studies explored the ability to 
visualize transformations of physical situ 
in deaf and hearing chi 


Youniss (1969) in a developmental study 


utilized some of the more recent techniques 


to observe what Piaget calls “mental imag- 
ery.” The authors 


examined children’s knowl- 
edge of horizontality of water level and pro- 
jection of shadows at two age levels (8 to 9 
and 11 to 12), The tasks included anticipa- 
tion, demonstration, retest, and generaliza- 
tion, in this order. The 16 deaf children at 
each age level were similar to their controls, 


with younger performing consistently poorer 
than older children, 


Only on anticipation of 
shadows did the younger deaf children show 
a slight retardation, 


22. A second study (Youniss & Robertson, 
1970) employed two imagery tasks in which 
each physical transformation led to a unique 
result that could not be subsumed under a 
single linear function as in the former inves- 
tigation. In the perspective task the child 
had to imagine what a given scene would look 
like from eight distinct locations (from 45° 
to 180°). In the rotation task a Square was 
moved around a fixed Square of a different 
color; the child had to imagine the location 
of the square as it rotated from 45° to 325°, 
Responses were made by selection of four 
alternatives. The three test phases w 
ticipation, demonstration with correcti 
retest. Two deaf samples at mean ag 
and 11-11 wer 
ing samples (N 
indicated age- 
deaf children 
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ations 
ldren. Robertson and 


ere an- 
on, and 
es 8-10 
e matched against two hear- 
= 24 in each group). Results 
related improvements. The older 
were comparable to their hear- 
ing peers, and the younger deaf children 
were similar on rotation but markedly in- 
ferior on perspective, particularly on the re- 


test. Taken as a Whole, this and the above 
study confirm Piaget’s theory that visual 
forms of logical 


thinking do not require in- 
ternalized linguistic Symbols but depend pri- 


marily on the development of operation 
thinking structures, } 
23. A pilot study on formal operatio 
thinking in deaf adolescents was presente 
by Furth and Youniss (1969). This is a E 
ficult question to probe decisively by no 
verbal means since the presence of formi 
operational thinking is primarily int 
from verbally formulated reasoning ra 1 
than from success observed in external pe" 
formance. Seven deaf adolescent boys, po? 
in written and oral English, were selected ? 
the basis of above average intelligence 1 
measured IQ. They were observed on a mogi 
fied nonverbal version of six Piaget M s 
combination, probability, volume, space, aol 
cline, flexibility. No one succeeded or ws 
clearly on all tasks, but five boys succee 
On at least one. The results were discuss? 
in relation to Piaget's theory of formal opea 
tional thinking. Some Support was seen 7” 
the proposition that formal operations | A 
velop, at least in their primary formato 
without the support of linguistic competence. 


Memory 


24. The division of the color continuur í 
commonly recognized to be a more o Al 
arbitrary affair, determined largely by soc", 
transmission, It is therefore of particular a 
terest to observe whether linguistically E. 
ficient deaf children differ from controls E 
a task that requires discrimination of a Pa 
ticular color. Lantz and Lenneberg (1966 

carried out such an investigation, using dole 
and hearing subjects at age 6 and at colle? | 
age. The principal task was to recall amo? i 
43 comparison colors, arranged in a circu 
array, two particular hues, which were 95 i 
presented followed by a 5-second intervie 
Recognition performance was better for adui 
than for children and for younger hearing d 
Compared to younger deaf children. Com", i 
nication accuracy was also measured for f 

hearing and the older deaf groups, and Be 
tive correlations were computed betw 
recognition and communicability Both ari 
sroups were found to differ somewhat in th f 
pattern of recognition errors over the C? f 
Spectrum. From these data the authors E 3 
cluded that color categorization devé 


ee eoe in deaf children and 
Teognitio eem ults linguistic skill facilitates 
Ds n without basically modifying the 

| pi Patterning of colors. j 
in ES pilot studies to explore strategies 
E con memory were reported by 
Eu and Furth (1966b, pp. 19-25). The 
| oe consisted of. pointing out three 
tran e in a presentation of nine drawings 
ene ina3x3 matrix. The child’s re- 
Or aft E was to point—either immediately 
l lain. 10-second delay—to the same three 
ain BS when the nine drawings were shown 
“Stores * a rearranged order. Recognition 
Mewhat deaf children, ages 7 to 10, were 
Oreover below that of hearing children; 
ten Doin with increasing age, hearing chil- 
Eu tà the three drawings in correct 
ticular] order with greater accuracy, par- 
s E When familiar (hence labelable) ob- 
ded re involved. A final experiment mM- 
lenti à preexposure period to facilitate 
y le m n to the two-dimensional relation of 
lighter from smaller to bigger and from 
Cores = darker, Under these conditions the 
ĉaf chi] he the children, but especially the 
‘tence dren, improved, and deaf-hearing dif- 
lemon a became negligible. Deaf children 
jns mated inferior recognition perform- 
strates; ce they did not employ as effective 
duced. as did hearing children. The readily 
| at Anis of training seemed to indicate 
à deficit ferio recognition was not due to 
Bis tan intellectual structure but to à lack 
4s 26, (dan utilization of strategies. 
n Questio hrow (1968) addressed himself to 
hine also taken up by investigators 
ing before (Studies 6, 7, 8: 10), 
a > the relation of sequence learning an 


Suisti 
!C deficiency, He tested on recall o 


E E T ERE 


Hs 
S 
r my 


ildr three groups of 14 
en, each taught by different meth- 
rq N hearing children. Stimulus ma- 
Ett Soot 4 three classes of visual forms, in 
ae fa €creasing meaningfulness: silhou- 
Yangon miliar objects, geometric forms; and 


» M 
^nt E Number of items per sequence 
: r s 
| Were e tyg to six, exposure times for 


is or j& seconds, and rates of 
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presentation for SUC were one to four forms 
per second. Three main results were reported: 
(a) Number of items per trial, meaningful- 
ness, and exposure time afiected all groups 
in the expected direction. (b) The three deaf 
groups did not differ among themselves. (c) 
Hearing children were superior to deaf in re- 
call of SUC, but on SIM there was no differ- 
ence. For the deaf groups, mean SUC and 
SIM scores were similar; for the hearing 
groups, SUC was easier. In conclusion, some 
measurable difference in the deaf child's im- 
mediate memory for successive items was 
observed in the form of a slight retardation 
that could easily be attributed to lack of lin- 
guistic experience. 

27. Rozanova (1966) reported two experi- 
ments on the pictorial memory of deaf chil- 
dren. In Experiment 1, partial and whole 
pictures were first presented and had to be 
reproduced after 10 minutes. Younger deaf 
children (ages 9 to 10) were found to perform 
more poorly than hearing controls; the de- 
ficiency was more marked in the partial con- 
dition and disappeared with age, resulting in 
no differences at ages 15 to 16. In a second 
experiment, deaf and hearing children (ages 
8, 10, and 12) were shown 16 pictures of 
familiar objects and had to recognize them 
20 minutes later within a set of 80 cards in- 
cluding some similar “trick” cards. Younger 
age and deafness were found to affect memory 
d On the same task, af- 


performance adversely. s 
ter a further interval of 10 minutes, two or 


three cards at à time were shown to the child, 
including incorrectly chosen pictures. In this 
condition deaf children corrected their mis- 


takes as well as hearing children. 


y for f nine 
test of memory for a span of 1 
28e ), Ross (1969) serially 


inary bols (+; — ) 
ety ies (alternating sym- 
of a run of three 


5 : € 

a e] and doubles in random se- 
: unpatterned series 

quence) E: w different symbols 

composed . Deaf and hearing 

E cof tested (7 to 15) did 

children at yit A hoth groups 
1 


ts of pat- 
etter on thi a 
x In opposition to results with 
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digit Span, average span length was only 
about four Symbols for all unpatterned series, 
The similarity of results between deaf and 
hearing for all types of series challenges some 


tention of sequential visual information, 

29. Goetzinger and Huber (1964) matched 
30 hearing and 30 deaf youngsters on age 
on memory-for- 
design (Benton Visual Retention Test). On 
immediate reproduction both groups were 
similar, both being somewhat Worse with a 
5-second than with 10-second presentation. 
But on 10-second presentation, followed by 
15-second delay, hearing children experienced 
no decrement in their performance due to de- 
lay; deaf children, however, performed worse, 
Particularly when the delay condition came 
first. The authors Suggested that verbaliza- 
tion may have aided the hearing children in 


memorizing the design when a delay was in- 
terspersed between presentation and repro- 
duction, 


30. Tn order to examine order errors indic- 
ative of type of memory encoding, 
and Rush (1965) 
quences from a given list of 10 consonants 
for immediate written recall to a sample of 
deaf youngsters (N = 41, age range 13 to 
20). From a control group of 53 hearing 
seventh graders, a subsample of the worst 15 
was matched with a subsample of the best 
15 deaf Subjects in terms of similar mean 
memory performance (71.7% retained let- 


ing Subsamples, significant rank order cor- 
relations (.47 and -52) emerged between 
memory errors and acoustic listening errors 
established previously by Conrad. Deaf sub- 
jects also made some systematic errors, but 
these were neither acoustically nor spatially 
related to correct letters. This pilot study 
thus confirmed the acoustic encoding of let- 
ters for hearing subjects and its absence for 
deaf Subjects, with the mode of deaf encod- 
ing processes still unclarified. 

31. In a follow-up investigation (Conrad, 
1970) 36 deaf boys read, silently and aloud, 
five- or six-consonant sequences from a list 
of nine consonants With exposure time of 5 


| 

or 6 seconds, respectively, The task rea 

immediate written recall of each sequen 

On the basis of obtained confusion errors, $ 
subjects with typical articulatory errors si 
lar to hearing controls were classified in 

Group A, and the rest with errors perhap 
related to shape into Group non-A. RM 
of speech quality correlated with this M. 
fication. In contrast to hearing subjects va 
regularly benefit from reading out loud M 
silent reading, notably by a stronger d 
effect, Group A was unaffected by real E 

modes, and the Group non-A made even m " 
recall errors with reading out loud. A n 
experiment demonstrated that Group A of 
more difficulty in serial learning of a lista 
homophone word pairs than a list of palam 
words of similar shape; whereas Group non y 
learned both lists equally well. This st 
provided interesting evidence of aika 
coding types on the part of deaf eic, 
But even those deaf subjects who appeared " 
use auditory-articulatory encoding were D 
as uniformly or strongly affected by auditor? f 
related conditions as are hearing persons. 


Perceptual Tasks 1 


32. Oléron and Gumusyan (1964) E. 
ployed a perceptual task of embedded ligue j 
(Poppelreuter Test) and tested 28 deaf M 
28 hearing children at ages 4 to 6. The ch | 

q 


dren had to choose from a series of individu 
drawings those objects or geometric deste 
which they recognized in the composite PA 
ture of superimposed drawings, With a scoa 
ing system that rewarded correct choices a? 
penalized Wrong choices, 


s TÉ Se 

deaf children M 

consistently somewhat Poorer; however, on” 
i 


objects but also on g& 


nvironment, 
33. Yashkova (1 


on deaf children "s 


t 
6) published a repo? 
abil 


^ 
t 
r which names were na 
available to the hearing children), the D. 
thors suggested that linguistic skill may i 
S 
1 
Py to make rover? | 
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| ravings of 
Hes 9 to 1] 


geometric figures. Deaf children, 
were found to be as good as 


Carine 
children MOM, while at younger ages deaf 
vere somewhat retarded. Addition- 


| ally, t 
Ne a p 
; tuthor reported overall improvement 


i lack is 


Slowing i i 
of ic RM Ra in generalized methods 
iscovery ig In early trials, spontaneous 
“etved in he: a generalized method was ob- 
"af child aring children at 5 years; with 
3 i» ren, however, only at 8 years. 
3 to 10 pd notions of deaf children, Grades 
i PPM investigated by Gozova (1966). 
ideq a tasks young children were pro- 
Pitta Gene drawing of a three- 
Mone orm which they had to find 
E es nore of concrete objects. Older 
tawing uu to recognize an object by its 
'awine op dimensions and then make 
ge Ormance is The observed similarity of 
Spatial tiis on the first task suggested that 
“dren wi are normally developed in deaf 
Second t the observed retardation on 
pe indicated to the author a 
of old ion and detailed thinking on the 
"AMET deaf children. 

chting an extension of earlier work by 
Uchman (1960) on color-form attitude, 
BY 36: 4, (1966) replicated Doehring's test 
Mean a aes and 36 deaf children with a 
the es of 10. She observed that 86% of 
de é nee children and only 33% of the 
he a Preferred form, while only 1496 
e Pade a and 67% of deaf children 
i iane or. Subsequently these same chil- 
i ested a similar difference In dis- 


Dart 


Sti ni s 
higher ve accuracy: hearing children had 
chi en fy scores for form than deaf 
qun ee conversely, deaf were better 
© childr Nora" 
ine? third b en for color discr imination. 
k, discrimination learning, hear- 


3 exe P 
elled over deaf children on the form 


les 

Ais 
Fours aile on the color learning task both 
Suc. Were equal. On the basis of these data, 


children viewed the color preference of deaf 
ar an acquired habit of categoriza- 
han Ps more adequate vis-à-vis color 
Me form cues. 
telateq arrier (1961) investigated a question 
Debe. to the one asked by Lantz and Len- 
e (e (1966) as well as by Suchman (1966). 
9und that hearing children over a wide 


lion 
Cues t 
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age range (6-18 years) showed a significant 
tendency to judge a black cube heavier than 
a white cube of similar size and material. 
The converse, that white is judged lighter in 
weight than black, was also observed, al- 
though to a lesser degree. In a comparison 
of other colors, yellow was reacted to like 
white, while blue and red elicited no sys- 

action. The experimental 


tematic weight re 
procedure was essentially nonverbal and in- 
a session with a balance and 


cluded a training 
weights. Deaf children over age 12 performed 


consistently similar to controls, but younger 
deaf children, below age 10, failed to mani- 
fest systematic color-weight associations. Sub- 
sequently, C arrier observed that hearing 
children rated color words in a weight hier- 
archy with black as heavy and white as light. 
The author suggested on the basis of these 
results that color-weight associations are pri- 
marily mediated by general linguistic experi- 
ence rather than by the arbitrary fact that 
in English the word “light” combines the two 


elements of the weight-color association in 


one word. 

37. Costa, Rapin, 
vestigated reaction t 
and paired (visual an 
in a group of 30 chi 
tion disorders and 2 


and Mandel (1964) in- 
ime to visual, auditory, 
d auditory) stimulation 
Idren with communica- 
8 control children of 
age 8 years. It was established that 
time was similar in both 
and that positive reinforcement im- 
a successive days. For 
n time was shorter 


mean 
visual reaction 
groups 
proved performance 0! 
controls, auditory reactio! 
than visual, and paired reaction time was 
still shorter than auditory reaction time. In 
deaf children, however, paired stimulation 
had no facilitating effect. 

38. It has long been held that with deaf 
persons who habitually use manual means of 
communication one can readily observe finger 
electromyography EMG) movement, analo- 
gous to something like internal verbalization. 
5 eye movements (REMs) have 


Moreover, rapid : ; 
ound associated with dream- 


recently been f y 
ing. Consequently Stoyva (1965) investigated 


whether in deaf adults REM during sleep 
would be related to EMG to a significant de- 
gree. On the basis of two experiments during 


sleep, one with seven deaf adults who habit- 


Wet, 


> d 


ine 


ually use manual finger spelling and the other 
With six deaf and six hearing controls, the 
following results were established. In deaf 
bersons REM, but not EMG, was associated 
with dreaming; EMG activity during REM 
periods in deaf was similar to that in hearing 
adults, and during non-REM periods there was 
a high degree of overlap between deaf and 
hearing by means of 
modern electrical techniques seem to chal- 
lenge the former view that linked finger EMG 
in deaf persons, 


later. 
Conclusions 

The original Summary article (Furth, 
1964b) was criticized by Blank (1965). She 


urth presented inadequate 
g that thinking can develop 
of language and pointed 


a more comprehensi 
from a theoretica] angle 
patterned after Piaget’ 
(Furth, 1966c). 
It would be a 
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children were poor in discovery of logic 
symbols but equal in the intelligent use p 
symbols that were shown to them. Yet Study 
14 reported a significant developmental delay 
in symbol use for deaf subjects compared t0 
middle-class hearing controls; at the same 


: 5, 
ume, results for the deaf were the same 4 


those for a sroup of Indian adolescents. 
Moreover, a haunting question always Te 
mains in cases where deaf children eventually 
do catch up: Would a more stringent p 
terion of success or a more complex task yiel 
differences? An analogous question can D. 
raised with respect to all tasks on which sig 
nificant 
training raise the performance of deaf sub- 
jects to a level equal to that of hearing con 
trols, and what duration and intensity k 
training would be required? In order to at 
rive at a more conclusive answer to thes’ 
questions this investigator and his associates 


: , e 
failures were observed: Would som 


are presently engaged in a detailed study ° 


Selected deaf adolescents which includes 
short-term and long-term training with indi 
vidual observations and uses hearing contro 
groups from different social environments. 
Before attempting to Suggest some tent 
tive theoretical conclusions, it may be ap” 
Propriate to restate in condensed form the 
basis on which the cited empirical studies 
with deaf subjects are evidence for these con* 
clusions, 1 
The assertion that deaf persons are lin 
guistically deficient does not mean that they 
are lacking in general symbolic ability or that 
they are ignorant of any and all English 


y 
words. Most deaf persons in our society know 
some English wo 


of the language, 
fitted to form meaningful s 


rds or phrases; but admit- | 


oon, the fact is overlooked and 
jority E a surprise to many that the ma- 
Spend POP born deaf, even though they 
| Sngle-mi to 15 years in schools with almost 
D: not raeg emphasis on language learning, 
EM oe comperare i he natural lan- 
_ teasons Ne society. For understandable 
With th is fact is usually not broadcast, and, 
Which a exception of one normative study 
Only — comparisons with hearing norms, 
E os Ding statements to this effect by 
he tit S of the deaf can be gleaned from 
erature, 

Eu. see differential measure of lin- 
tional mene does not exist. An addi- 
is t ak o iculty with profoundly deaf persons 
reading & comfortable skill in speech or lip 
Sa 8 is an exceedingly rare achievement. 
of a hable compromise, comprehension 
Nt star p reading as measured by pres- 
ADpropri, ardized tests can be suggested as an 
fade riate criterion. Reading tests below 
et sample only fragmentary aspects of 
ge, and there should be general agree- 

t that knowing a few words and isolated 
| Peteng does not constitute linguistic com- 
fo Ws Significantly, the standardizing study 
| (1963 "I&htstone, Aronow, and Moskowitz 
à , based on an imposing norming total 
ang 524 deaf pupils between the ages 104 
3, demonstrated strikingly that the av- 
Sto. reading achievement of deaf pupils 
S short at Grade 3. On the Metropolitan 
gro entary Reading Test the youngest age 
— 654) had a mean 

ve the chance 
e standard de- 
s of na- 
that norms this indicates (Furth, 1966b) 
in the mean score over a span of 5 years 


fed from a grade equivalent of —— 
9f 2 ^i 3.6, never reaching the mean aec 
| ever "Ihe norm for beginning rade 4. ? ae 
Í aboy, "ardly 10% of the total sample sco al 
Ons; Grade 4, a reading Jevel that expert 
It “ider to fal] within “functional literacy.” 
for, 2ds to reason that the poor reading is 
Tele ance of deaf school children cannot y 
dicsted to any of the usual types of rea E 
to culty. A slow reader is rightfully assumed 
* quite competent in the language 
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which he has reading difficulties. Not so with 
deaf persons. In their case, reading level is 
almost uniquely determined by competence in 
language and therefore is diagnostic of that 
competence. 

Here, then, is quantified evidence that only 
a small percentage of deaf persons are at all 
comparable to hearing persons in being “at 
home” in the conventional language of the 
society. In other words, apart from a few 
notable exceptions, the difference in linguis- 
tic skill between persons born profoundly 
deaf and hearing persons is not merely statis- 
tically significant, but it is as great as the 
difference in level of hearing. At one extreme, 
there are hearing youngsters steeped from 
earliest childhood in the conventional lan- 


at the other extreme, there are deaf 


guage; 
youngsters who during the early formative 


years of their life have no ready-made symbol 
system available. The conventional gestural 
sign language in use among deaf adults is 
only available in childhood to those deaf chil- 
dren who are born of deaf parents—a very 
small percentage of the deaf population. 
Hence, it is literally true that most deaf chil- 
dren grow up during the early formative pe- 
riod of life without any systematized conven- 
tional language. The significant fact is that 
in spite of not having been provided with a 
conventional symbol structure, deaf children 
construct their own symbols as they are 
needed for the development of thinking. On 
this crucial fact rests the present writer's in- 
terpretation of the reviewed studies. 

Another important point for a theoretical 
interpretation is understanding what can and 
what cannot be demonstrated by means of a 
deficiency experiment. If an organism without 
a certain factor fails on a given task, one can- 
not conclude that the factor is directly re- 
lated to the task since other uncontrolled in- 
fluences may be at work. But if he succeeds 
on a given task, one can infer conclusively 
that the missing factor is not a prerequisite 
for the task. As a consequence, those experi- 
ments where deaf subjects are somewhat re- 
tarded produce a far less conclusive inter- 
pretation than experiments where they are 
equivalent. 


The apparently well-founded objection that 
the proposed interpretation regards deaf sub- 
jects as forming a single homogeneous group 
in linguistic deficiency does not quite touch 
the core of the argument, Unquestionably, in 
studying deaf children, account must be taken 
of wide individual variations in degree and 
type of hearing loss and in psychological ad- 
justment to the loss. However, the argument 
presently proposed is not based on a disre- 
gard of individual differences but on the fac- 
tor of linguistic deficiency that is common to 
the vast majority of individuals within the 
deaf sample, Actually some of the individual 
differences, such as possible neurological 
lesions associated with hearing impairment, 


would be working against the intended null 
hypothesis, 


Another objection to the 


tation points to several in 
pearance at a la g i 


present interpre- 


A possible explanation is that older deaf chil- 
dren are linguistically more competent than 
Younger deaf children, Such an explanation 
would be more plausible if the linguistic in- 
crease were truly substantial. However, it 
was shown that even at a near adult age most 
deaf persons are still woefully lacking in lin- 
guistic competence, Whereas the observed 
data are clearly consistent with the postula- 
tion of a certain developmental lag for deaf 
in comparison to hearing children, to suggest 
that the catching up is due to a slight increase 
in linguistic competence is inadequate to ex- 
plain substantial increases that take place in 
a number of quite diverse tasks. Rather, most 
aspects of deaf children’s time lag can be 
More parsimoniously attributed to an easily 
demonstrable restricted past experience and 
training, not unlike the situation that pre- 
vails for culturally different hearing children, 

During the Moscow International Congress 
of Psychology, O’Connor (1966) and Furth 
(1966d) Presented summaries of recent work 
with deaf Subjects. O'Connor reported on a 
number of studies intended to demonstrate 
the importance of speech for thinking. Yet, he 
concluded, the recent "findings . . . force on 
me the somewhat unwelcome conclusion as 
far as I am concerned that whatever effect 
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auditory deprivation may have on speech, ! 
cannot be said to have a strong effect on re 
soning capacity [p. 38].” 

The recent work in deafness has produ 
further confirmation of the theoretical ee, 
cations presented before. The data conti 
that at least up to and including the | 
operational stage linguistic support is not e 
quired. Concerning the relation of langun 
to formal operational thinking, data are $ i 
sketchy, and firm conclusions are hard á 
draw, if only because these operations are n r 
easily tested even in the adult hearing poy 
lation. However, the general conclusion seo 
abundantly clear that thinking processes. 4 
deaf children are similar to those of hearin? 
children and therefore must be explains 
without recourse to verbal processes. This E 
teresting fact implies that the current emp. m 
sis in psychology that attributes the pon 
of intelligence mainly to verbal factors is Y. 
questionably inadequate and that undue ^ 
phasis on linguistic skill, particularly in ed 
education, may not be a sound basis n 
nourishing intellectual development qu 
1970). Insofar as Piaget (Inhelder & Pio 
1964, p. 293) stated categorically that p. | 
guage is not a constitutive element of We 
thinking, it is intriguing to consider that E 
evidence collected in this review posi 
strong empirical Support as can be reasona? ^ 
expected for his theory, As a general aP 
proach, nonverbal methods may be extre 
fruitful for an objective study of think | 
and for an adequate appraisal of the functi? 4 
of the conventional language and other sy” 
bols in the development of thinking. 
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measurement are reviewed and 
heart rate and skin resistance, 
treatment seems a rel 
components of fe: 
different from those of 


decrement with repetition, 
ins unc 


| All available studies concerning 


this decrement rema 
other behavioral changes i 
uted to varying rates of. c 
counts of the mechanisms in 
the evidence reviewed. 


The ; 
th Sue DUM of autonomic responses in 
Q0blas is Sx maintenance of fears and 
gite Our requently taken for granted, de- 
Nature of continuing ignorance of the precise 
logical i the relationship between the physio- 
Current nd psychological components of fear. 
ay, opinion based on what evidence 1S 


atively effective method of modif; 
ar. The reported effects of phobic 
neutral imagery 
although the role played by 
lear. Partial dissociation bety 
n treatment appe: 
hange in the differen 
volved in desensitiza 


` any of the differen 


desensitization and utilizing psychophysiological 
evaluated. Autonomic changes. particularly in 
do occur following desensitization, and this 


ing such physiological 
imagery are consistently 
and show a characteristic. response 
relaxation in facilitating 
veen physiological and 
ars frequently, and may be attrib- 
t systems. Some theoretical ac- 
tion are discussed in light of 


ed from different measuring devices, 
ated to some degree, show a 
independence from one an- 
other. As Lang has pointed out, there are no 
a priori reasons why we should assume that 
rent attributes of fearful be- 
havior exclusively define the presence or ab- 


sence of fear—rather we should assess as 
ble and take them 


e greater the degree of 
the greater the 
of our labeling the 


fear deriv 
although rel 
marked degree of 


lso be applied to processes 
he elimination of fearful 


ai 
table appears to be converging on the view 
fully ear, normal or abnormal, may most use- many components as possi 
Kimea as a label referring to the all into account. Th A 
Bical . us occurrence of responses, physio- relationship between them, 
Perfect psychological, in a series of im- likelihood, and usefulness, 
Ver] al Y coupled systems, for instance, the observed state “fear.” 
log: /cognitive, behavioral/motor, and phys- This view may a 

1967; directed towards t 


Bica : 
al/autonomic systems (Lacey; 


ang 

Wis 695 Lazarus, 1968). 
Med $ the nature of the interactions in- 
me it not entirely agreed upon, jt has be- 
, orcibly apparent that assessments of 

Th 
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as FANRAN United Kingdom. An extended version 
Ments aes in partial fulfillment of the require- 
* Re or a PhD degree at the University of London. 
M, Quests for reprints should be sent to Andrew 
logical athews, Department of Psychiatry, Psycho- 
bital, Treatment Research Unit, Warneford Hos- 


University of Oxford, Oxford, England. 


© 1971 by the American Psychological Association, 


73 


behavior with potential profit. While psycho- 
clined to direct their atten- 


therapists are in 

tions exclusively to modification via verbal ex- 

pressions of emotion, and may thus influence 

other systems indirectly (e.g. Lacey, 1959), 
are characterized by a 


behavior therapists 
readiness to directly attempt the induction-of 
behavioral and autonomic changes as well. 


While this is true of many procedures 
called behavior therapy, such as aversion, only 
systematic desensitization treatment is consid- 
ered here, partly because of the more wide- 


Inc. 
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Spread clinical use and application of this 
technique, and partly because the treatment 
Was devised and formulated with the modifi- 
Cation of behavioral and autonomic fear 


an "index of anxiety,” 
The motive fo 
measure May sim 


used are often unstandardized in response to 
the demands Of treatment in the individual 
patient, 

Nevertheless, 


these studies may have some 


distinct advantages; they remain close to the 
clinical treatment si 
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2. In controlled studies contrasting desenst- | 
tization with alternative treatment P. 
dures, it seems appropriate to utilize a w 3 
variety OÍ assessment methods, including 
psychophysiolosical measures, In fact, aston- 
ishingly few trials have utilized such meas 
despite the practical and theoretical m 
tance of changes in the autonomic responso 
involved in phobic and psychosomatic dis 
orders. One possible reason for this is that ) 
be used appropriately as an assessment s 
treatment-produced change, physiological i 
Sponse should be measured in the presence E 

an appropriate phobic stimulus, a condusi 

not easily reconciled with the demands 0 


* 3 u- 
polygraphic recording. One of the best sol 
tions to this n 


(1966) in usi 
immediately b 


pated phobic Situation, an approach which 
turned out to be a Productive one m 

3. The decade that followed Wolpe's com 
prehensive 


Investigations 
Sensitization ha: 
Strategy of anal 


desensitization 
relaxation and 
graded phobic image. 
not bear directly on 
above. To do this it 
design an experiment 


Most effective whe? 
Presentation a 
mbined, they | 


YPotheses outline 
Would p 


ide 3 
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i 5 : 
Miss physiological and psychological 
entations um foe and noncontiguous pres- 
While such p hobic stimuli and relaxation. 
ing, Ais $e Lee sig experiment is still lack- 
| een ES. een an increasing interest in 
Using eel iori. vt oi these hypotheses 
Picton, ae eda So recordings during 
Simi Ea the imagination of phobic 
Diesen : — "d a better evaluation of the 
theory, atus of the reciprocal inhibition 
gen a relevance to the postulated under- 
andful M Mr of desensitization are a 
eld to ie one studies in which a process, 
action te T e desensitization in its mode of 
Psychoph rate pla and assessed using 
studies Voie ogical measures. Such analogue 
about en of course, prove anything 
ment ong happens in desensitization treat- 
al eds they do indicate that a number of 
& Banane. models are available which may 
Or exam describe the treatment process; 
habituatio e, reciprocal inhibition, extinction/ 
n, and cognitive rehearsal models. 


Un 

CON " 

i oe Case Stupies oF PATIENTS 
óRGOING DESENSITIZATION TREATMENT 


he classified. under this heading are 
nly y characterized by the application of 
sistance. autonomic variable, mainly, skin re- 
Choice e (galvanic skin response—GSR). This 
Availabifit" be determined both by the ready 
ides ility of portable equipment. and by 
TOR assumption that skin resistance is 
respo or to all other measures 1n sensitivity of 
Asun to emotion-provoking stimuli. This 
an is frequently made despite the 
auton erable work on relative specificities 1n 
end i responding (Lacey, 1967) and the 
Ships t criticisms of expecting simple relation- 
vari between physiological and psychological 
eg referred to earlier. 
Eois case studies of this type have been 
Pai ed by Meyer (1957), Clark (1963), and 
"lass ^ all of whom assume that the 
item na or absence of a GSR to an imaginal 
En evidence of the degree of ex- 
(1966). eie Similarly, Hoenig and Reed 
Vine? pes i and Brown (1967), and Gale, 
reports be Ayer (1970) gave unsystematic 
Esine the clinical use of skin resistance 1n 
sitization. While all these studies seem 
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based on the same assumption, Hoenig and 
Reed expressed some skepticism based on the 
apparent dissociation observed in individual 
patients between GSRs to phobic words, 
images, or real objects, and verbal report. 
Despite this, general agreement seems to exist 
that a steady increase in skin resistance ac- 
companies relaxation, while the responses to 
imaginal items decrease with repetition. 

A somewhat more systematic approach was 
used by McAllister (1966)? who found that in 
some patients the amplitude of the GSR to 
phobic images was significantly greater than 
that to neutral images before desensitization, 
but that there was no difference between im- 
ages at the end of treatment due to the usual 
decrement in response to phobic images. 

The most informative of the uncontrolled 
studies have been reported by Agras (1965, 
1967). In the earlier report, Agras described 
the spontaneous recovery of subjective anxi- 
ety to imaginal items following desensitiza- 
tion sessions in six patients said to have multi- 
ple phobias. Fifty percent of the desensitized 
items showed some degree of recovery in sub- 
jective anxiety when represented in a subse- 
quent session, and this effect was paralleled 
by a similar recovery in skin resistance re- 
sponse to those items. Tn the later study, Agras 


reported the relationship between progress 
through the imaginal hierarchy, skin resis- 
test items and progress in 


tance response to 
real life, in five agoraphobic patients. The 


relevant measures of treatment progress were 
derived from a cumulative record of imaginal 
items completed, that is, imagined without 
reported anxiety, and a detailed self-report 


elicited from the patient of all traveling that 


was successfully carried out between treatment 
sessions. Prior to specified treatment sessions, 


five preselected items spread over the pa- 
tient’s hierarchy were presented in imagina- 
tion, and the mean GSR amplitude calculated. 
Graphic presentation of the obtained data sup- 
ports the author’s claim that cumulative prog- 
ress through the hierarchy was closely corre- 
lated with diminished GSRs to imaginal test 
items in three of the five patients, while 
progress in real-life practice tended to lag 
3 McAllister, H. The physiological correlates of 
anxiety. Paper presented at the meeting of the British 
Psychological Society, London, December 1966. 
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behind. In the other two patients, while hier- 
archy progress still 
tice, skin resistance 
changed last of all. 

Wolpe and Flood (1970) reported a 
study of the GSRs to imaginal items at dií- 
ferent hierarchy levels 
in relaxation, and in fiv 
such training, W 
group was teste 
this section, as n 
tistical analysis. 


preceded change in prac- 
responses to the test items 


are pre- 
show a fairly clear 


ever, while responses 
Sroup tended to be greater 


although it is unclear whether 
as a simple habitua- 
or attributed to the 
action of reciprocal inhibition. While Most 
application of skin 
resistance measures clinically useful, a number 
of cases Were reported in which an apparent 
dissociation occurred between skin resistance 
Tesponses and other behavioral changes, thus 
rendering the general clinical usefulness of the 
GSR as a single measure extremely doubtful. 


DESENSITIZATION TRIALS UsiNc 
PHysIoLocIcar ASSESSMENT 


This section considers those controlled stud- 
ies which have the Primary aim of establish- 
ing the effectiveness of some variant of the 
desensitization procedure, but which have in- 
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corporated physiological measurement as part 
of an assessment battery. 

Twelve asthmatic patients, 6 adults and 6 
children, were treated by Moore (1965) ina 
controlled comparison of desensitization, aa 
laxation plus direct Suggestion, and relaxation 
alone. Assessment methods included a seli- 
report of the number of attacks and estima- 
tions of respiratory maximum peak flow at 
the beginning and end of each session. Mean 
change in the number of self-reported asth- 
matic attacks tended to favor desensitization, 
but all treatments Produced subjective im- 
provement, and no significant differences were 
found on analysis of variance, Change m 
maximum peak flow, however, showed little 
effect due to treatments other than desensiti- 
zation, which produced a marked improve- 
ment significantly greater than relaxation Of 
Suggestion. d 

The most thoroughly controlled trial ™ 
which Physiological measures were used tO 
assess outcome is that of Paul (1966). Su 
dents with fears of public speaking were allo 
cated to desensitization, psychotherapy, atten- 
tion/placebo, or no-treatment groups, d 
assessed before and after the treatment using 
self-report, behavioral, and autonomic ls. 
Sures. Each subject was required to give p 
short speech before a small audience, puls 
rate and palmar sweat being measured i 
seconds and 30 seconds, respectively, prior 


iens 
the speech, Self-ratings and behavioral sig” 
of stress both respond 


nd in self-ratings 0 
mely low and non 
Physiological as 

assessment wé 
related to the q ehaviora] clianzel 


€gree of 
however, and this latter measure Correlate 


1 


| 
| 
| 


no 
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29 with pulse rate change, .25 with palmar 
E. change, and .35 with a composite physi- 
Ogical score derived from both measures. 
foie from these results that desensi- 
activit ees significantly reduce physiological 
h ly in subjects anticipating à real-life 
ee auntie that of speaking in public, 
Consist te effect was both less marked and less 
dition eat tolong other treatments. In ad- 
ae association, although a low one, was 
E etween the extent of this physiological 
Derforman the improvement in the subject's 
kin ance under phobic stress. 

tion of resistance levels during the pre: 
Used b has phobic stimuli (snakes) were 
Measure giis and Edwards (1967) as ape 
Sensitizati outcome in a comparison of A 
retest . fee, with and without relaxation. n 
rg after treatment. both groups showed a 
and endum in skin resistance response; 
Ments ere was no difference between tak 
and E this measure, although behaviora 
E o jective changes showed a significant 
Both a for desensitization with relaxation. 
and p, Zeisset (1968) and Leitenberg, Agras, 
arlow (1969) recorded skin resistance 


Tes : 
Ponses during treatment sessions and found 
outcome of desen- 


of elicited GSRs. 
some item 
xiety in 
tenberg 
anxiety 
dly than 


senta- 


iig ee tonshlp between the 
n 5 lon and the frequency 
t Menting on the finding that 
Sentations elicited a report of an 
* absence of a detectable GSR, Lei 
bes suggested that physiological 
nses might extinguish more rap! 
med report of anxiety. 
lma tou, ini the GSR to 
Aginary phobic situations before and after 


esensitizat: a j 
ensitization were studied by Barlow, Lei- 


“berg, Agras, and Wincze (1969). Follow- 
GSRs to phobic 


ing ; 
ing, Imaginal desensitization, t 
9 oed declined significantly, while responses 
Vivo s] real situation showed little change. In 
Nsom on the other hand, pro- 
est significant reductions In GSR to both 
ipw ilb, suggesting good transfer back 
poke to imagined scenes, but. little or no 
d generalization from the imagined to 
lel situation. The same workers have 
ed a comparison between a “shaping” 
ppo cedure in the presence of the real phobic 
ject, and imaginal desensitization (Barlow, 


both real and 


Agras, Leitenberg, & Wincze, 1970) with 
very similar findings. 

A relatively novel psychophysiological mea- 
sure, the penile plethysmograph, was used by 
Bancroft (in press) to contrast the effects of 
desensitization and aversion therapy in the 
treatment of homosexual patients. In general 
both methods produced a very similar amount 
oi change as determined by a series of atti- 
tudinal and behavioral measures, and both 
treatments were associated with significant 
increases in erectile response to heterosexual 
ight be expected in view of the 
results reported earlier, the extent of erectile 
change did not correlate significantly with 
improvement measured in other ways. 

In a comparison between desensitization 
and flooding (implosion) treatment, Boulou- 
gouris, Marks, and Marset (in press) recorded 
skin resistance and heart rate responses to the 
description and imagination of phobic scenes 
before and after each treatment. The results 
owed significant superiority for flooding over 
rms of ratings made by 
the therapist and by an independent assessor, 


and in terms of the diminished physiological 


responses to the imagined test items. Changes 
to the fantasy 


in the physiological response 
test items were generally insignificant after 
six sessions of desensitization in these pa- 
tients, but six sessions of flooding produced 
significant reductions in both heart rate and 
skin conductance response during phobic fan- 


pictures. As m 


sh 
desensitization in te 


tasy. 

In general, it is possible to arrive at some- 
what more definite conclusions from the sur- 
vey of controlled studies than from the previ- 
section. Moore's study alone indicates the 
for including appropriate physio- 
ements in the assessment of the 
effects of desensitization especially where 
somatic stress responses are prominent fea- 
tures. In Moore’s and in Paul’s study, it was 
convincingly demonstrated that desensitization 
was responsible for a significant diminution in 
the physiological response to stress, and that 
other procedures involving relaxation, sug- 
gestion, and a therapeutic relationship, tended 
to produce mainly subjective improvements 
and relatively little physiological change. Tt 
s very likely that while many types of 
al procedures can influence verbal 


ous sec 
necessity 
logical measur 


seem: 
psychologic 
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behavior, desensitization alone of those men- 
tioned, is equally effective in reducing the 
autonomic components of fear. However, it 
may be that other procedures, such as flood- 
more effective in re. 
phobic stimuli. 


subjective 
treatment. Although al] Sys- 
same direction, 
little correspondence between 


other autonomic responses, 


although Seneralization in 
the reverse direction may occur (Barlow et 
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Analysis of dese 
ponent parts has p 
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SITIZATION 


c compo- 
nents such as suggestion, positive expectation, 


and encouragement from the therapist (e.g., 
Leitenberg et al, 1969) are not included, 
since these have been discussed in detail by 
Wilkins (in press). 
Relaxation 


Three methods of training in relaxation are 


commonly used in desensitization, Jacobson’s 
method of Progressive relaxation (Jacobson, 
1938), the direct induction of relaxation by 
hypnosis, and autogenic training (Schultz & 
Luthe, 1959). Autogenic training has been 
used less frequently in combination with de- 
Sensitization, but has been described by Kon- 


7 


das (1967) and Moore (1965), and ee 
to produce satisfactory results. It is there E 
assumed here that the active agent canti 
ing to the importance of relaxation in desen 
tization is common to all three methods, rH 
for this reason studies examining the physi 
logical effects of each are described. ‘cally 
The physiological effects of hypnot ni 
induced relaxation have been extensiv d 
studied, but very infrequently in the form E 
controlled experiments where the d 
hypnotic Suggestion are contrasted with 96 
induced relaxation, Barber and Hahn (1 duh 
reported finding no physiological effects tion, 
ing initial hypnotic suggestions of epe. 
Which were not also seen in a control g 
simply asked to sit quietly. n auto" 
Repeated use of autosuggestion, as in e " 
genic training, has usually been WE E. 
the form of uncontrolled case studies zi 
Schultz & Luthe, 1959), but Geissman with 
Noel (1961) found a number of physio E 
changes characteristic of autogenic relaxa di 
that were not present in subjects at a 
These included decreased respiration afl 
absence of GSRs, and electroencephalog!?". 
(EEG) with predominating alpha Hiythm, 5 
occasional theta discharges. These gu 
would seem reliable in that they have E 
replicated by severa] other workers feo dat 
& Jus, 1969), and it is perhaps surprising nat 
More rigorously controlled studies have 
been performed. 


s " the 
The most commonly cited evidence for th! 
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k A pe 
physiological effects of relaxation in w-— 
tization derives from the work of Jacobs© 


rapist or 
testing environment, as did the trained pa- 
tients. It is quite Possible to accoy 


Nt for all 


3 


a 


RA 
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the " 
ae; results in terms of either adap- 
n effects or the spontaneous remission of 


| 5 
ymptoms over time. 


E ia controlled study of brief muscu- 
carried rence as used in desensitization was 
E is m. y Grossberg (1965). Thirty male 
ot ios. divided into three groups, one 

received recorded instructions in 


P. relax P x 1 
xation for two 40-minute periods, approxi- 


fies”, E days apart, while one control group 
lected oh same time by listening to self-se- 
self-rela “axing music, and the other attempted 
ieee without further instructions. 

nyograms from the forehead and fore- 


> atm 
Were recorded throughout, as was skin 


Tesis i 
oe and heart rate. No differences 1n 
al level were found among the groups in 


| an 
|. "lY of the measures used, indicating that the 


Telaxas; 
era training procedure did not have any 
ed physiological effect. 
rather similar study was reported by 
Were Que in which 60 female students 
Whether [o into three groups according to 
relaxation ney received training in muscular 
Self-relax » hypnotic relaxation, Or attempted 
Em without instruction. Electro- 
Piration : rom the forearm, heart rate, res- 
Sureq ate, and skin resistance were mea- 
Bio, ina through the two j-hour 
Broup s. Compared with the self-relaxation 
ion a the group trained in muscular relaxa- 
Ce found to have a significantly greater 
essione i of physiological activity over the 
S in all measures except skin resistance, 

€re no significant differences were obtained, 


` ang 
Nd the trend of the results was in the oppo 


Site direction, 
m. difficult to reconcile th feren 
View S of Paul’s and Grossberg’s studies in 
in Fes the rather similar procedures used; 
etw, dition, there seems some contradiction 
A T the results of hypnosis obtained by 
(1963 (1969c) and by Barber and Hahn 
ings ^ Further negative results, obtained 
Eie Ub from more recent studies in which 
axati nts have received similar training in re- 
Sion Fi tend to support Grossberg's conclu- 
TER hat very brief training 1s only slightly, 
all, effective in reducing the level of auto- 
hen compared with the rest- 


e very different 


Noms 
Mic activity w 
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ing state (Lehrer, 1969;* Van Egeren 
Feather, & Hein, in press). : 

A somewhat different procedure was 
adopted by Mathews and Gelder (1969) ina 
study of the effect of relaxation training in 
phobic patients. Fourteen patients were given 
two training sessions derived from Jacobson's 
abbreviated technique (1938) and were then 
tested during a further relaxation session and 
a control period, while forehead EMG, heart 
rate, respiration rate, and skin resistance were 
continuously recorded. Significant reductions 
in skin conductance level and fluctuation rate 
were found during relaxation as compared 
with the control condition. Electromyograms 
remained consistently lower throughout the 
relaxation session, and both heart rate and 
respiration rate tended to drop more during 
relaxation, but this effect did not achieve an 
acceptable level of significance. All measures, 
especially skin conductance fluctuations, pro- 
duced significant correlations with subjec- 
tively estimated depth of relaxation, with the 
exception of EMG which showed no such re- 


lation. 
The 
previous studies, 


main differences between this and 
apart from the use of psy- 
chiatric patients, is that the subjects were 
relatively better practiced and familiar with 
relaxation, so that the active muscular exer- 
cises used earlier could be discarded, and the 
procedure used was similar to that employed 
in a standard desensitization session. The 
most striking contrast between the results of 
this study and those of Grossberg and Paul, 
that is, the skin resistance findings, may most 
probably be attributed to this difference in 
procedure. ; ' 

It is difficult to summarize the available 


knowledge concerning the effects of brief re- 
and the contradictions in 


laxation training, r 
the results suggest the operation of at least 
some factors not yet fully understood. Only 
Paul has produced any evidence that the par- 
ticular procedure adopted had any specific 

the method of training, inde- 


effect related to 
pendent of the general effects of adaptation 


_ A laboratory analog of desensitiza- 


3 Lehrer, P- M. 
effects of relaxa 


hysiological 


tion: Psychop : 
Paper presented at the meeting of the Society for 
logical Research, Monterey, California, 


Psychophysio 
October 1969. 
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and the reduction of external stimulation. It 
seems reasonable to suggest that several spe- 
cific procedures may have the effect of reduc- 
ing autonomic and muscular activity, but that 
whether these effects are observed, and in 
which measure, depends on details of pro- 
cedure and timing, the length of training, and 
the nature of the comparison control pro- 
cedure. 

If the results obtained by Paul are accepted 
in contrast to those of Grossberg, the use of 
active exercises in training may have the 
effect of inducing transient periods in which 
muscular, cardiac, and respiratory activity is 
minimal. Relaxation as used in actual desensi- 
tization, however, may be associated with a 
marked diminution in skin resistance activity, 
which is correlated with sensations of subjec- 
tive relaxation and calm. Further periods of 
practice, as are employed in autogenic train- 
ing, may be associated with even more pro- 
found changes, observable in the EEG record. 


Phobic Stimuli 


The relationship of autonomic responding 
to the intensity of experienced emotion, in 
particular anxiety, has been the source of a 
long continued controversy. A review by Mc- 
Curdy (1950) is commonly 
of the use of the amplitude 
index of emotional intensi 
relevant studies found by 
positive, and usually hig 
tween GSR and judged i 

Although the studies r, 
generally refer to relatively physical stimuli, 
such as emotional words or electric shocks, it 
is fairly well-established that the response to 
imaginal or vicariously experienced stimuli 
produce results similar to directly experienced 
stimuli (Barber, 1965: Craig, 1968; Miller, 
1951). 

Lomont and Edwards 
& al. 


quoted in support 
of the GSR as an 
ty, since all the 
McCurdy reported 
h, correlations þe- 
ntensity, 


eviewed by McCurdy 


(1967) and Barlow 
(1969, 1970) have reported the only 
controlled studies which included physiologi- 
cal measurement of the response to real-life 
phobic stimuli, in this case, the GSR in snake 
phobic subjects to presentations of a snake at 
varying distances, Since no control stimuli 
were presented at the time, the 


only conclu- 
Sion possible from these Studies 


is that the 
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GSR to phobic stimuli declines following de- 
sensitization or exposure to the real object. 
An alternative method of examining the 
physiological effects of phobic stimuli is t0 
compare responses to viewing pictures of neu- 
tral and fear-evoking objects. Geer (1966) 
assessed the change in skin resistance cause 
by viewing pictures of spiders and pictures 0 
snakes, in subjects who had expressed a fear 
of spiders only. Subjects reporting an intense 
fear had larger responses to pictures of spiders 
than those who reported mild fear, while simi- 
lar groups did not differ in responding less t0 
pictures of snakes. The difference between the 
first group and the other three was highly 
significant on first showing of the spider slide 
($ = < .02), less so on second showing (p= 
< 05), and just failed to reach significance 
on the third and final presentation. While 
clear-cut differences in the predicted direction 
were found, it is of interest that habituation 
effects rapidly and effectively reduced thes® 
differences to the point of nonsignificance. i 
Studies essentially similar to that of Gee! 
have been reported by Wilson (1966, 1967): 
In the first of these experiments Wilson 5 
tablished that skin resistance responses to # 
series of slides, including pictures of land- 
scapes, snakes, and spiders were not signifi- 
cantly effected by the picture content when 
shown to subjects presumed to be without 
abnormal fears. Two phobic subjects were 


E 
i 
tested individually, one of whom complained 
Responses to pictures of cats in the first sub- 
f 
p 


of a fear of cats and the other of spiders: 
ject were found to be considerably greater 
than responses to 11 other animal pictures 
on four successive trials, while a compariso? 
of alternating Spider pictures and landscape 
in the second patient showed a significantly 
greater response to Spiders, which habituate 
to the point of no difference over four sessions: 
In the second study, 10 students who h^ 
expressed a fear of Spiders were contraste 
With 10 control subjects without such fear? 
Sight pictures of spiders were projected M 
alternation with eight landscapes while p. 
GSR was measured as before. Use of the e 
of response to spider pictures, divided by HT 
Tesponse to landscapes, gave perfect separati? 
between phobic and nonphobic subjects. on 
again, however, a clear tendency was appa 


X 
X 
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for responses following phobic slides to ha- 
bituate, although this took longer to com- 
plete than did responses to neutral pictures. 
As in the study by Geer, no additional mea- 
sures were used that would allow conclusions 
as to the subjective state experienced by the 
subjects, or even generalization to other physi- 
ological variables. In the absence of self-report 
data, it would be obviously fallacious to as- 
sume that the large GSRs to phobic pictures 
reflected phobic anxiety; they may equally re- 
flect more careful attention paid to them by 
subjects having a special interest (Flanagan, 
1967). 

The stimuli most commonly associated with 

desensitization are produced in fantasy, as the 
use of imagery makes up in flexibility what it 
lacks in objectivity and accessibility to pre- 
Clse measurement. 
_ Avery early study, whi 
issue, was carried out well before the use of 
desensitization was established (Rowland, 
1936). Four normal subjects, capable of a 
deep hypnotic trance, were given 12 scenes to 
visualize, which varied in their “exciting” 
quality. Although the use of few subjects and 
Primitive physiological recording techniques 
limit generalization, the study was carefully 
done, and the conclusions may well be consid- 
ered useful. Pulse rate, respiration, and skin 
resistance were all found to relate to the pre- 
Judged scene intensity, while repeated presen- 
tations of the scenes led to à diminution 1n 
these responses. 

In McAllister's study 
Phobic situations drawn at random from each 
Datient's hierarchy, preserved in hierarchal 
Order, were interspaced with neutral scenes 
and presented for imagination while skin re- 
sistance was recorded. On initial presentation, 
five out of eight patients responded signifi- 
cantly more to the phobic than to the neutral 
Stimuli, 

The effect of imagining 


a . n 
dun been studied using ple E 
Neasurement of forearm muscle blood flov 


(Gelder & Mathews, 1968). Three imaginary 
Situations were found for each of 19 phobic 
Patients, a neutral scene and two phobic 
Scenes, one of which was taken from the bot- 
9m of a desensitization hierarchy and one 
Tom the top. The patients were asked to 


ch is relevant to this 


(see Footnote 3), 10 


phobic stimuli has 
thysmographic 


imagine each scene in turn and to rate immedi- 
ately afterwards any subjective anxiety experi- 
enced during the image. Mean blood flow was 
raised during the imagination of both phobic 
scenes, although only during the higher phobic 
scene was the level reached significantly 
greater than that during neutral scenes. 

A more thoroughly controlled study was re- 
ported by Grossberg and Wilson (1968), in 
which forehead EMG, heart rate, and skin 
resistance were recorded as subjects imagined 
fearful and neutral scenes. Substantial in- 
creases in heart rate and skin conductance 
were observed during the reading of both 
phobic and neutral scenes, but significant dif- 
ferences between scenes in heart rate and skin 
conductance responses were confined to the 
imagination period. However, by reanalyzing 
in terms of the frequency of responses, irre- 
spective of amount, reading of fear scenes was 
found to produce significantly more skin con- 
ductance increases, and imagining produced 
significantly more heart rate increases. Repe- 
tition of these scenes showed a systematic 
decline in responses, closely resembling a 
typical extinction or habituation curve. Elec- 
tromyogram data did not show the regularity 
apparent in the autonomic variables, and no 
significant differences were found between the 
scenes in terms of muscle activity. 

A further group of control subjects, not 
reporting any of the relevant fears, was 
matched with experimental group subjects and 
given identical instructions. Only heart rate 
response during the reading of fearful scenes 
showed a significantly greater effect in experi- 
mental than in control subjects, while the 


difference in heart rate and skin conductance 
response found earlier between fearful s 
neutral scenes was still present in the cni 
subjects. Since the two scenes should not have 


been emotionally different for control M 
this result is against prediction anc 
hat in the presentation of the fearful 
esence of some cues had a rela- 
tively greater effect even in nonphobic M 
Finally; correlations were calcu ate 

between physiological responses and mee 
estimates of vividness. Only one comely 
was significant, that between kenty rate re- 
and vividness (x = .60). 


jects, 
suggests t 
scenes the pr 


jects. 


sponse to imagery 
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It can reasonably be concluded that pre- 
dictable autonomic responses follow the imagi- 
nation of fearful Scenes, such as those used in 
desensitization, and that these responses de- 
cline systematically with repetition, even in 
the absence of relaxation training. The per- 
sistence of scene differences in control sub- 
jects, however, Suggests strongly that voice 
cues, subject expectancies, or some other un- 
known factors are involved in producing these 
differences. 

The effect of imagining fearful and neutral 
scenes on respiration changes in 55 students 
was reported by Rimm and Bottrell (1969), 
As in the previous experiments cited, signifi- 
cant differences emerged in physiological re- 
; increase in res- 
as significantly 
nes. No signifi- 
ound when each 


relation of .3 (p 
this rating and 
respiratory response to the fearful scene. 

he nature of the interaction between the 
effects of phobic imagery and of relaxation 
training has been further studied by P; 


as not disturbing (neutral) and as disturbing 
(fearful) from the Wolpe-Lang Schedule, in 
that order, 

After the training sessions had been com- 
pleted, all subjects were again asked to imag- 
ine the fearful scene, first for 5 seconds, then 
for 10, and finally for 20 seconds, only the 
last image being scored. As before, forearm 
EMG, heart rate, respiration rate, and skin 
resistance were measured, but analysis was 
confined to those particular measures in which 
an individual had shown signs of responding 
on initial presentation of the fearful scene. 
Both the muscular relaxation group and the 
hypnosis group showed a significantly greater 
decline in physiological response than the 
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control group, and the extent of this deci 
was found to be related to the average redu 
tion in physiological activity during relaxa: 
tion (r = .46). Its 
In addition, and contrary to the resu ) 
found by Grossberg and Wilson, Paul (19589 
reported that control (untrained) subjects 4 
tually showed an increase in composite ue 
sponse to the fearful image on the final Pa 
entation. This result raises the possibility t e 
some unknown procedural difference we 
the methods used by Paul (19692) and o 
berg and Wilson (1968) seriously influenc is 
the experimental results. One possibility a 
that Paul’s control group, unlike his other ud 
groups, had not habituated to perpe 
from the therapist during training, and E 
well have overreacted to the imagination e: 
Structions following a prolonged period, ai 
Silence. Despite this, it may at least be € in 
cluded that repetition of fearful imagery Fy iN 
the relaxed subject leads to a diminution N. 
autonomic response, although it remains be 
clear whether relaxation might produce ja 
immediate reduction in the response to d 
image presentation, or whether relaxation "e 
associated with a more rapid rate of decli! 
in responding, the 
One of the most extensive studies of bee 
Physiological effects of phobic imagery 
recently been completed by Lang, Melame 
and Hart (1970). In the course of a con 
parison between automated and live desni 
tization, which showed the automated trea 
ment to be at least as effective as conve” 
tional methods, heart rate, respiration rates 
forearm EMG, and skin resistance were mea- 
Sured throughout treatment, Electromyogra™ 
activity was generally found to be minima 


a marked decline thi. 
à > this was iously 
monotonic. "5 less obwiois 
The most Strikin 


s g findi 
correlations between 
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| bs ET quom measure of the behavioral 
of treatm eported reductions in fear at the end 
MM Sedi Subjects who showed the highest 
sponded e during a signaled item also re- 
05) tthe to treatment (r= .15; p< 
in this ^ the correlation between the decline 
tations ( cu rate response to repeated presen- 
even high abituation) and good outcome was 
heart rig! (r= 91; PS 01). The rate of 
Ment (e : ecline at other points in the treat- 
Srna within relaxation sessions) did not 
ductane with outcome, nor did skin con- 
, Telate E or respiration. responses appear to 
ee treatment effectiveness. , 
that a ger from this first experiment 
CUN M. who are more responsive auto- 
phobic d as assessed by heart rate during 
Change scenes, and in whom this physiological 
ety " is synchronous with self-reported anxi- 
A Naa og both a more clear-cut decline in 
ante response and subsequent better 
jects iy response to treatment. The sub- 
gave oe responded poorly to the treatment 
signals ges a similar number of anxiety 
physiolo ie these were not synchronous with 
Th gical response. 
scenes pun experiment, responses to phobic 
Speakin E students having à fear of public 
dents m were contrasted with those in stu- 
lecteq ina a fear of spiders. Five prese- 
dom o hierarchy items were presented in ran- 
both rder and interspaced with neutral items, 
Dia subjective anxiety and vividness of 
ately at being rated on à 0-4 scale immedi- 
ee each item. Heart rate consistently 
Sente cw as higher hierarchy items were pre- 
consist and similar although somewhat less 
ge results were found for skin resist- 
physi res respiration. Correlations between 
ety 9 ogical response and self-report of anxi- 
eg vividness tended to be maximal when 
atio rate was used, although a high corre- 
Tespo was obtained between skin resistance 
A nse and reported vividness. 
been Som pie but well-controlled study has 
Collea, ecently reported by Van Egeren and his 
Which ine (Van Egeren et al., in press), in 
of imagined items spaced over a hierarchy 
Ih ean speaking situations were presented 
^ alanced order to 15 subjects trained in 
elaxation and 15 untrained subjects. Heart 
Tate, respiration "3t skin resistance, and 
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digital pulse amplitude were recorded through- 
out, and all of these measures significantly 
differentiated between phobic and neutral 
items. The effects of item repetition, and 
the interaction with relaxation and hierarchy 


position were unexpected, and in some re- 
lier work. Only 


spects inconsistent with ear 
digital pulse amplitude response consistently 
declined with item repetition, regardless of 
training in relaxation, and this decline did not 
show any tendency to recover between ses- 
sions (cf. Agras, 1965; Grossberg & Wilson, 
1968). However, there was an overall tendency 
for skin resistance responses to all items to be 
less marked in the relaxation group. Items 
higher in the hierarchy (i.e, more intense) 
elicited significantly greater skin resistance 
and digital pulse amplitude responses, but 
contrary to expectation the rate of response 
decrement was greater for these higher items 
than for those low in the hierarchy. Again, 
contrary to expectation, where relaxation 
training had significant effects, they were most 
marked for the scenes of highest intensity. 


While the results suggest that the response 


decrement to imaginal items is not the pre- 
dicted simple function of repetitions, relaxa- 
tion training, and hierarchy position, some 


serious reservations must be made before 


making a direct comparison with desensiti- 


zation treatment. For example, items were not 
given in ascending order, and the same item 
was not repeated consecutively until anxiety 
was reduced. In fact there was no significant 
decline in anxiety ratings throughout the ex- 
periment (two sessions), nor were there any 
significant differences in autonomic resting 
levels due to relaxation, suggesting that the 
experimental procedure was sufficiently dif- 
ferent from desensitization treatment to se- 
verely limit the generality of any conclusions 
drawn. : 

A considerable amount of evidence has been 
summarized in this section, and the conclu- 
sions to be derived vary from the fairly cer- 
tain to the wholly hypothetical. In study after 
study, phobic stimuli have been shown to 
evoke significantly greater autonomic re- 
sponses than neutral stimuli. It remains un- 
clear, however, to what extent these auto- 
nomic responses are entirely attributable to 


phobic anxiety, and to what extent factors 
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such as expectation or cues given by the ther- 
apist may also be involved. Johnson and 
Campos (1967) have implicated the subject’s 
knowledge that he is required to make a sub- 
sequent verbal report on his experience during 
imagery, as a critical factor in determining 
the extent of heart rate responses to neutral 
images. However, the finding of significant 
relationships between heart rate and anxiety 
or vividness in two independent studies goes 
at least some way to support the former ex- 
planation, and is consistent with Wolpe’s ac- 
count of the mechanism of desensitization. 
Both heart rate and other cardiovascular 
measures, forearm blood flow and pulse amp- 
litude, have been found to show increased re- 
sponses as items higher in the hierarchy are 
presented. Heart rate may be a particularly 
responsive measure during phobic imagery, as 
opposed to real-life visual stimuli, because of 
the tendency described by Lacey (1967) for 
heart rate to decelerate during orientation 
reactions to external stimuli and to accelerate 
during mental tasks when stimuli from the 
external evironment are ignored. 

When phobic scenes are described, prior to 
imagination, skin resistance responses may be 
more frequent than heart rate responses 
(Grossberg & Wilson, 1968), and when pic- 
torial stimuli are used, the GSR consistently 
differentiates between phobic and neutral stim- 
uli. It seems, therefore, that while autonomic 
Tesponses reliably discriminate phobic from 
neutral stimuli, the particular measure most 
appropriate may depend on the type of stimuli 
presented. 

Some researchers, with the notable excep- 
tion of Paul (19692), have reported that 
autonomic responses to phobic stimuli show a 
marked tendency to diminish with repetition, 
whether or not the subjects have been trained 
in relaxation, Paul's study was the only one in 
Which responsiveness appeared to increase 
with repetition, although the usual reduction 
occurred in subjects trained in muscular or 
hypnotic relaxation, The findings of Van 
Egeren's experiment, although difficult to in- 
terpret, suggest that the presence and extent 
of such a decrement is 4 complex function of 
the particular Physiological measure used, 
interacting with other interpolated stimuli, the 


presence of relaxation, and hierarchy level of 


ANDREW M. MATHEWS 


the items presented. However, as was pointed l 


out earlier, the procedure involved in this ex- 
periment differed in many respects from de- 
sensitization treatment. 

Lang's intriguing results provide evidence 
that the subject's facility in obtaining imagery 
vivid enough to elicit clear autonomic re- 


sponses synchronous with the verbal report | 


of anxiety, as well as the subsequent diminu- 
tion of such responses, are critical factors 
associated with the successful outcome of 
treatment, It is, of course, possible to argue 
that this finding reflects only the incidental 
side effects of a cognitive change occurring 
in desensitization; however, the stronger hy- 
pothesis would seem to be that the presence of 
autonomic responses to imagery, the decre- 
ment of such responses with repetition, and 
possibly the subject's awareness of these 
changes may all be essential parts of the de- 
sensitization process. 


ANALOGUES OF DESENSITIZATION 


Studies included in this section are all pri- 
marily concerned with investigating or dem- 
onstrating mechanisms which may underlie 
desensitization, and all arrive at different 
models of the treatment process, based on 
experimental analogues. The best known theo- 
retical account of desensitization, that of re- 
ciprocal inhibition, or counterconditioning 
(Wolpe, 1958), has been demonstrated in 
analogue form by Grings and Uno (1968). 
The effects of visual cues that had been as- 
sociated with muscle relaxation and electric 
shock were compared singly and in combina- 
tion. Both skin resistance and pulse volume 
responses tended to be greatest to the shock 
cue alone, were slightly reduced to a combi- 
nation of shock and neutral cues, and were 
least to the shock and relaxation cue combina- 
tion. , 

These results suggest strongly that condi- 
tioned autonomic fear responses and learned 
relaxation. can reciprocally inhibit one an- 
other and thus provide a useful experimenta 
situation in which variables relevant to de- 
sensitization may be manipulated. Howeve"? 
further work would be required before it coul 
be concluded that such counterconditionin? 
actually occurs in the treatment situation, ^ 
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E nis process is a necessary part of suc- 
b esensitization. 
model em to the counterconditioning 
md eed een recently proposed by Lader 
findings ge (1968) based in part on the 
Tate to si a relationship between habituation 
sensitiz ee stimuli and the outcome of de- 
In this let (Lader, Gelder, & Marks, 1967). 
Bd n atter study phobic patients were 
Sess ES BM desensitization in order to as- 
Peated e of habituation of the GSR to re- 
Ei iy stimuli, Behavioral signs of 
Were si Ims during pretreatment testing 
ahituatio icantly associated with delay in 
of skin on (r = .36) and with the frequency 
linical Vendee oe fluctuations (r= .61). 
cao of improvement after de- 
treatmer ion were also correlated with pre- 
m he physiological measures to 2 signifi- 
Tate a i 49 in the case of habituation 
tance fh d 42 in the case of skin conduc- 
B snis Loos Although these results are 
Ship etia in terms of the expected relation- 
a poor ween high levels of anxiety and both 
nomic eine to desensitization and auto- 
More i re it is possible to postulate a 
Sensitiz; ue link between habituation and de- 
Giese Lader et al. have proposed that 
Similay ization may be viewed as a process 
nder to that of habituation taking place 
Es a mal conditions, that 1s, while the 
Consist, remains at the lowest level of arousal 
ent with consciousness. 


5 in the previously cited study these re- 


Sults 
ened only be considered suggestive an 

Phy: lot provide direct evidence 25 to the 

i in desensiti- 


Sio : B 
logical processes occurring l 
more direct evi- 


10 
n. Attempts at providing 
f these responses 


en 

BÉ cain the nature 0 1 

anq a made by Van Egeren (in press) 

Egerg, elamed (cited by Lang, 1970). Van 
en concluded, on the basis of the studies 


Drey; 

. vio a i n 

tio Pine discussed, that while any diminu- 
n responses to imaginal stimuli is not 


n 

effec 

By pie by relaxation in the way predicted 

Dara, Unterconditioning, neither are the 
decrement 


r 
ideny tes of such a response 

ical with those found in the case of sim- 
Wo Physical stimuli (habituation). This 
Sion {Pear to be a very reasonable conclu- 
Studie ut requires confirmation by further 
S of the response decrement occurring 
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when imaginal scenes are repeated without 
interpolated stimuli conceivably having dis- 
ruptive effects. » 
Melamed recorded skin resistance responses 
in snake phobic students to both a series of 
simple auditory stimuli and a series of films 
showing snakes. Subjects were divided into 
three groups on the basis of preliminary in- 
structions to simply attend to the films, to 
experience the full force of their emotional 
or to utilize training in relaxation. 
“attend” and “relaxation” groups 
e decrement to the phobic 
elated highly with habitua- 
stimuli, and significant 


tion rate to auditory 
correlations were found between this habitua- 
ire measure of fear 


tion rate and a questionnai 
change, consistent with the results obtained 
with patients by Lader et al. (1967). Curi- 


ously, however, correlations between auto- 


nomic response decrement to the films and 
fear did not reach a 


reported reduction in 
significant level, so that these results cannot 


be considered as unqualified support for the 
habituation model of desensitization. 

Folkins, Lawson, Opton, and Lazarus 
(1968) have studied methods of reducing the 
effects of viewing a stressful film as an ana- 
logue of desensitization. The effects of prior 
relaxation, cognitive rehearsal with stressful 
imagery, and both procedures together were 
contrasted with no treatment in terms of their 
efficacy in reducing heart rate and skin con- 
ductance responses to a film portraying à fatal 
accident. Contrary to prediction, subjects who 


had received cognitive rehearsal only; that is, 
reported 


response, 
Both the 
showed a respons 
stimuli which corr 


less anxiety 
kin condu 

ful accident scen 
= scis the cognitive rehearsal group had 
a less marked peak than any other group. 
Heart rate responses did not follow the aan 
pattern, and no significant differences d 
found among the treatment groups for t 2 
variable. Correlations between psychologica 


and physiological measures of stress were gen- 


i ing between 
low, the highest being that e 
ee a d stress and peak heart rate during 
= 32, p < 01) 

d by Folk- 
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of desensitization in this experiment is that 
the tape-recorded imagery instructions pro- 
ceeded rapidly and inflexibly to stressful 
scenes, and may thus have disrupted relaxa- 
tion. Some weight can be attached to this sug- 
gestion, as a greater skin conductance response 
occurred to imagined stressful scenes in the 
desensitization group than in the cognitive 
rehearsal group. Thus the addition of relaxa- 
tion instructions appears to have been associ- 
ated with an increased autonomic response to 
imagery, contrary to reciprocal inhibition 
theory. Folkins considered that his results 
provide support for the role of cognitive 
processes in desensitization, and cited the 
criticisms of Stimulus-response theorists made 
by Breger and McGaugh (1965). However, 
the considerable differences between this ex- 
periment and treatment practice, such as the 
use of nonphobic subjects and the anticipation 
of a stressful film in place of the usual phobic 
imagery, must limit the degree to which gen- 
eralizations can be made to desensitization. 
Although not employing physiological mea- 
Sures, consideration of an experiment by 
Valins and Ray (1967) may be relevant. In 
this study, snake phobic subjects listened to 
a recording which they believed to be of their 
Own heart sounds and were thus persuaded 
that their heart rate decreased as they watched 


slides of snakes. In contrast to a control 
these subjects show; 


in snake avoidance 
pretated by the auth 
tant cognitive factor 


group, 
ed a significant reduction 
behavior, a result inter- 
ors as evidence of impor- 
s in desensitization, While 
it seems likely that the subjects awareness of 
reductions in autonomic response to phobic 
stimuli may have an important role to play in 
desensitization, Valins and Ray's experiment 
15 not relevant to the postulate that real auto- 
nomic changes are necessary in order that de- 
Sensitization should be maximally effective. In 
this connection it is perhaps a pity that this 
study did not in fact include heart rate re- 
cordings in order to examine the possibility 
that false feedback may have induced real 
heart rate changes in the same direction. 
Perhaps the only conclusion that can be 
derived from these Studies is that advocates 
of counterconditioning, habituation, and cog- 
nitive models of desensitization can all point to 
some experimental support from studies utiliz- 


ing psychophysiological measurement. AD 
these models may be heuristically useful, bu 


their relative value can be provisionally as- | 


sessed by a consideration of their ability to 
account for the results described earlier. 


Discussion 


The evidence presented in the previous sec 
tions strongly supports the idea that psycho- 
physiological measurement is of potential E 
in studying both the process and outcome 0 
desensitization. : 

In an earlier review of psychophysiological 
approaches to psychotherapy, Lacey ( 1959) 
came to a similar conclusion as regards the 
study of psychotherapeutic process, but Me 
less optimistic as regards the assessment 0 
outcome. It may well be that one of the most 
important aspects of behavior therapy is that 
by defining more precisely the target behav- 
iors and the procedures involved in treatment; 
objective measurement of autonomic and be- 
havioral events becomes both more possible 
and more useful. Examination of physiological 
responses during desensitization has alreadY 
led to the confirmation of some predictions 
regarding the processes involved, and the dis- 
confirmation of others. The choice of appro- 
priate physiological measures, the nature O 
the response to phobic imagery, the interac 
tion of this response with relaxation, the rela- 
tionship between changes in the different com- 
ponents of fear, and the theoretical mecha- 
nisms underlying desensitization are each con- 
sidered in the light of the available evidence: 


Choice of Physiological Measure 


Although useful results have been obtained 
using respiration and EMG measures, the 
most consistently significant effects have been 
Observed using skin resistance or cardiovaS 
cular responses, particularly heart rate. Lacey 
(1967) and Graham and Clifton (1966) hav? 
Suggested that heart rate deceleration E 
companies the intake of information from t” 
environment, while concentration on intern? 
mental events, or rejection of the enviro" 
ment, leads to heart rate acceleration. en 
dence consistent with this hypothesis has p 
reported by Craig (1968) who found a 
imagining a painful stimulus produced ; 
much heart rate increase as the actual sti™ 
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bean he the vicarious stimulation of 
B 5 ellow subject experiencing pain 
EC accompanied by any such increase in 
oth rate. Skin resistance response, on the 
er hand, was more sensitive to the vicari- 
o» than to the imaginal stimulus. It might 
m erefore be expected that heart rate should 
54 a relatively sensitive indicator of response 
he phobic images, while skin resistance might 
e more responsive to externally presented 
phobic stimuli. 
,Crosbers and Wilson (1968) found evi- 
rd that skin resistance responses were 
Bent requent during scene descriptions, while 
in cbe responses were more frequent dur- 
E ne actual image. Estimated vividness of 
thee a was significantly correlated with 
rate, but not with skin resistance. 
Lang et al. (1970) found a similar relation- 
Ship between vividness and both heart rate 
m skin resistance response, although only 
Sedi Kate correlated significantly with sub- 
p anxiety. In this latter experiment heart 
Fa response was also more closely related to 
archy level than with the other measures 
cai Skin resistance responses have not al- 
mee been successful in the consistent sepa- 
Me phobic from neutral images (e.g. 
ister, 1966, see Footnote 3), although 
When phobic stimuli are presented visually 
(Geer, 1966: Wilson 1966, 1967) the GSR 
vas a highly effective discriminator. The well 
nown tendency of the GSR to habituation, 
Nowever, rapidly reduced this discriminative 
ial as the stimuli was repeated. The choice 
the most appropriate measure would there- 
Ore seem to depend on the nature of the stim- 
uli used, as well as on individual response 


Specific; : 
Decificity, However, 


weSponse to phobic imagery 
lent of outcome (e.g, Paul, 
bs addition of at least sk 3 s 
» uld seem advisable on general grounds, an 
s coially when real phobic stimuli are pre- 
B hted (Barlow et al., 1969) or as a measure 
logos es in relaxation (Mathews & Gelder, 
9). 


1966). 
in resistance 


E 
Bects of Phobic Imagery 


paie of the most consiste 
erge from this review is 


nt findings tO 
that imagine 


phobic scenes as used in desensitization evoke 
larger autonomic responses than scenes judged 
to be neutral, and that with repetition these 
responses show a consistent decrement. Wolpe 
(1958) has commented that under certain cir- 
cumstances, anxiety does not dissipate between 
images but appears to summate. This clinical 
observation has not generally been supported 
by psychophysiological evidence, with the 
exception of one experiment by Paul (19692). 
Such a response increment would thus seem 
to be the exception rather than the rule, and 
its occurrence may depend on relatively high 
anxiety in the subject, or on the frequent 
application of high intensity stimulation. 
These conditions have been described as those 
which disrupt the usual process of habitua- 
tion to repeated stimuli and may lead to an 
upward spiral of responses into panic (Lader 
& Mathews, 1968, 1970). They are also the 
same conditions prevailing in flooding ses- 
sions, suggesting that if continued for long 
periods this upward spiral must be self-limit- 
ing and in the end will also lead to decrement 
in autonomic anxiety responses (Boulougouris 
et al., in press). There is thus tentative evi- 
dence of two related procedures having a simi- 
lar result; presentation of mild phobic images 
interspaced with a rest or relaxation period, or 
the presentation of intense phobic images un- 
der conditions of prolonged massed practice, 
both leading to à marked decrement of auto- 
nomic response to imagined phobic stimuli. 


Effects of Relaxation 
The evidence concerning the physiological 
effects of relaxation training suggests the 


possibility of transient reductions 1n cardio- 
and muscular activity, 


vascular. respiratory; : 
: described by Jacobson 


llowing exercises / 
a j sions following longer 


. Relaxation ses 
be nd of t ever, may be accompa- 


raining, how re qe 
i 8 n ve 
ied by evidence of a generally lowered Te 
of der (Mathews & Gelder, 1969). De- 
spite this, nO direct evidence has been found 
for one of the centra 


] postulates of reciprocal 
inhibition theory, th 


at relaxation reduces or 
tonomic anxiety responses as- 
prevents the autonom 5 
P ciated with phobic imagery. On the basis of 
Paul’s (1969a) experiment, it may be tenta- 
tively concluded that a respon 


se to a phobic 
image, at least by the fourth presentation, is 
s 


= 
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less in the relaxed than the nonrelaxed sub- 
ject, without it being clear whether immediate 
inhibition, an accelerated 


decrement, or both, are involved. The effects 
of relaxation may 


reported by Wolpe and Flood (1970). 
In the Folkins et al. (1968) analog 


ce that a relatively large 
to phobic images, fol- 
decay, is associated with a 


autonomic response 
lowed by a rapid 
good treatment ou 


work of Antro} 


ase the vividness of 
imagery experienced during desensitization. 
Correlations between subjective estimates of 
vividness and the degree of autonomic re- 
sponse (Grossberg & Wilson, 1968; Lang et 
al., 1970; Rimm & Bottrell, 1969) further sug- 
gest that more marked physiological respon- 
Siveness to Phobic imagery may thus be pro- 
duced. Hence, the evidence that has been re- 
viewed gives rise to the hypothesis that re- 
laxation may augment both vividness and the 
autonomic effects of imagery, while at the 
same time maximizing the response decrement 
with repeated Presentations. 


Relationship between the Components of Fear 


Desensitization (and flooding procedures) 
have been found to have a relatively powerful 
effect in red ucing autonomic anxiety responses. 
However, the dissociation between physio- 


logical, behavioral, and subjective changes 
noted in case studies of desensitization has 
been confirmed in controlled trials le 
1965; Paul, 1966). Despite this, it seems 
extremely unlikely that no causal relationship 
exists between autonomic responding MK. 
desensitization and subsequent -— 
change. Indeed, Lang's most recent "o 
(1970) suggest Strongly that marked au 7 
nomic changes during treatment, agri 
with reported anxiety, are an essential er 
of successful desensitization. An sao. 
explanation for the reported dissrepanc 
that causal relationships may be obscured » 
a tendency for the separate components ee 
fear to change in differing amounts at d! 
ferent times (e.g, Lang & Lazovik, 1e 
Within imaginal desensitization, Kabig 
tion of the GSR to phobic images may phr 
cede the elimination of anxiety signals CA 
tenberg et al., 1969), and although this e 
may be reversed in some individuals (Agr M 
1967), both of these usually precede e 
actual phobic behavior, Barlow et al. (196 13 
have confirmed uncontrolled observan 
(Hoenig & Reed, 1966) that following pem 
nal desensitization, there may be emi 
little reduction in GSR to the real pho ee 
object, although the response to iman 
scenes had greatly decreased. peciit 
change in this group was also small iompar 
to a group who had been repeatedly Epos K 
to the real phobic object, and this latter Hes 
showed a very marked decrement in GSR, 3 
further presentations of phobic objects; ee 
deed, the mean response after treatment was 
less to the real than to the imagined Te 
This important finding suggests a considerab 
degree of stimulus specificity in response dec 
Fement, and argues against the idea that elimi- 
nation of autonomic responses to imagina 
es to any great extent to the 


change resulting from dese 
with a progressive reduction 
sponse to imagery, Associate 
by, decreasing reports of 
Behavioral change having the 
closer contact with the rea 


may then occur, followeq b. 
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bic situation 
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Spon; : 
nal iy (iene to the real-life stimulus and 
0 the an uction in self-ratings of anxiety 
Ween these B object or situation. Overlap be- 
lent, and ns rocesses must occur to some ex- 
fi fosyneratic may be a considerable degree 
‘atic response ordering in some pa- 
1S further postulated that such 
jm aifend are responsible in 
“ved betw extremely low correlations ob- 
change in “eee the extent of posttreatment 
“uring or cue different systems, since testing 
fin " shortly after treatment is likely to 
Jects at different stages of progress. 


Theoretical 


Mechani Jui 
Csensitizat nisms Underlying 


1 ion 
Was 
e iced S the tentative conclusions that 
red that ged from this review, it i5 consid- 
Section jn zm analogue described in the last 
‘Count of fact represents an oversimplified ac- 
Was d total desensitization process. It 
May be ler suggested that phobic imagery 
tronger made more vivid and may evoke 
Subject initial autonomic responses when the 
a 1S In a relaxed state. Should this be 
it must. clearly modify reciprocal 
án ARE EDEN to the extent of assigning 
le to rela lal facilitating and later inhibiting 
Consiq axation. If the habituation analogue 
th as ered, then a possible explanation for 
hi iag nation of vivid imagery and large 
Sugge t omic response with good outcome 
3 ount itself, According to Sokolov’s (1963) 
Benerated | habituation, a neural model is 
i? Which by application of a repeated stimu- 
"comin h then acts to inhibit responses to 
Mode] g stimuli which match the internal 
In this way, observable responses 


Stadu. 
u H H 
ally cease as the neural model is built 


Up 

> Unless 

of ess the organism learns that occurrence 
conse- 


ence Stimulus leads to significant 
Cceur S. If a related process can be said to 
imaginal desensitization, then it 


rly be necessary for vivid imagery 


Ou] 
5 d clea 
ise construction of 


be 

an in experienced, otherw 
Not men model would be delayed or might 
SU € place at all. On the other hand, Lader 
Mzio (1967) have presented evidence that 
taneg S Patients with a high level of spon- 
tr m Us autonomic activity habituated ex- 
Ted slowly, and Lader and Mathews 
Wa 2) suggested that relaxation may be one 
Of counteracting this elec Considering 
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desensitization as analogous to habituation 
would therefore lead to the hypothesis that a 
state of low arousal, for instance, relaxation 
produces two useful effects; it facilitates the 
occurrence of a vivid image and thus a better 
internal model, while at the same time it 
maximizes the rate of response decrement to 
that image. 

The failure of generalization across systems 
or from imaginal to real stimuli presents 
further problems, however, There are several 
processes which may be involved in extending 
the effects of desensitization from decrement 
in response to imaginal scenes to a reduction 
in phobic avoidance behavior. Intuitively it 
seems likely that phobic avoidance is partially 
maintained by the anxiety associated with 
anticipation and mental rehearsal, and that 
following desensitization this anticipatory 
anxiety is reduced. Indeed, the pre- and post- 
treatment tests used by Paul (1966) to assess 
autonomic changes took place immediately 
before subjects were required to speak in 
public, so that there is some evidence of im- 
mediate posttreatment physiological change 
during anticipation of contact with a phobic 
situation. In this study, autonomic measure- 
ment was not continued throughout contact 
with the real-life situation, but the postulate 


put forward here would lead to the predic- 
tion that under these circumstances the effect 
of the imaginal desensitization would have 
been less marked, as was found by Barlow et 

creased autonomic 


al. (1969). Despite this, de 
and lessened verbalization of 


tion may well lead to 
of approach behavior 


responsiveness 
anxiety during anticipa 
an increased likelihood 


in the phobic situation. ` ; 
Awareness of reduction in autonomic re- 


sponses may lead to therapeutic attitude 
change as theorized by _Valins and Ray 
(1967), while the expectations of both thera- 
pist and patient may provide a further moti- 
vating influence (Leitenberg et al., 1969). 
Once increasing contacts with a phobic situa- 
tion are made in the absence of anticipatory 
anxiety, 4 decrement in autonomic responding 
to real phobic stimuli is likely to occur, lead- 
ing finally to self-reports of no anxiety in real 


life. E 
To summarize, an attempt to encompass as 


vidence as possible within a relatively 


much e Ea 
t of desensitization leads to the 


simple accoun 
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postulate that the initial effect is one of re- 
sponse decrement in the autonomic and self- 
report components of anxiety during imagery, 
whether this is described as reciprocal inhibi- 
tion, extinction, or habituation. Possibly aided 
by several different cognitive processes, this 
decrement leads to increased exposure to real 
phobic situations, and thus enables a similar 
decrement of autonomic and verbal responses 
to occur in real life. 
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generally 
focusing 


; and the task 
(d) research examining the 

suggests that anxiety reduces 
(e) "worry," an attentionally 
ing of task performance than 
implications 
discussed. 


Highly test-anxious persons typically per- 
form more poorly on tests than do low-test- 
anxious persons, particularly when the tests 
are administered under Stressful, evaluative 
conditions, The literature reviewed in this 
article suggests that this performance differ- 
ence is largely due to a difference in the 
attentional focuses of high- and low-test- 
anxious persons during task performance. 
The low-test-anxious person is focused on 
task-relevant variables while performing tasks. 
The highly test-anxious subject is internally 
focused on self-evaluative, self-deprecatory 
thinking, and perception of his autonomic re- 
sponses. Since the difficult tasks on which the 
test-anxious person does poorly require full 
attention for adequate performance, he cannot 
perform adequately while dividing his atten- 
tion between internal cues and task cues. 

1 This review 
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fulfillment of th 


is based primarily on a dissertation 
€ University of Waterloo in partial 
e requirements for a doctoral degree. 

?'The author Wishes to thank the members of her 
dissertation committee, Marvin Brown, Chairman, 
Donald Meichenbaum, and Herbert Lefcourt for their 
helpful suggestions and advice. Particular gratitude 
is expressed to my husband John J. Wine, for 
suggesting the basic theoretica] approach. ^ 

Requests for reprints should be sent to Jeri pos 
Department of Psychology, University of Waterloo, 
Waterloo, Ontario, Canada. 


test-anxious Persons are 
conditions in 


relationship between anxiety and task variables 
demanding cognitive activity, is more debilitat- 


is autonomic arousal. Treatment and research 
of this attentional interpretation of test 


N 1 


anxious person who focuses 
as supported by 
anxious persons are 
people low in anxiety; (b) the selí- 
activated in testing 
greatest performance 
tendencies of highly 
OW-anxious subjects; 


lized in performance; 


anxiety are briefly 


ed 
Marlett and Watson (1968) have stat 
this proposition rather well: 


is 
The high-test-anxious Person spends a part e 
task time doing things Which are not task OR oil 
He worries about his performance, worries bate. 
how well others might do, ruminates over cho ji 
open to him, and is often repetitive in his ei 
to solve the task. Any effort to overcome the o 
defeating kind of behavior which the hgh 
person exhibits should concentrate on allowing i 
to perform without the constraints which Tok 
the negative effects of the avoidance beh 
[p. 203]. 


;MENT 
Test ANXIETY THEORY AND MEASUREME 


The ideas presented above are restatements 
of test anxiety theory as advanced by T 
originators (Mandler & Sarason, 1952) an 
later contributors, Test anxiety theory was 
initially presented by Mandler and Sarason 
in their article introducing the Test Anxiety 
Questionnaire (TAQ). In contrast to genera | 
anxiety scales such as the Taylor Manifest | 
Anxiety Scale (MAS; Taylor, 1953), the TAQ 
Was constructed as a measure of anxiety 
proneness in a specific kind of stressful situa- 
tion—the testing situation, It was expected 
that items which related specifically to the 
subject’s reactions to testing situations would 
be more predictive than general anxiety scales 
of his behavior in these situations, 
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Tt w 
as assumed that two kinds of drives 


' are evoked ; 
ked in the testing situation. The first 


t 
Bei ue Wand task drives, which are 
Which end i responses or response sequences 
& Sarason e of the task [Mandler 
iS a learned m: 166]." The second kind 
two classes Persie drive, which can elicit 
Completion OF TESDODSESS those related to task 
those whicl ‘eich are anxiety reducing, and 
is hiter interfere with task completion. 
Was os sa class is the one which the 

nstructed to measure: 


These 

of inde ... may be manifested as feelings 

Action, ane helplessness, heightened somatic re- 

lr sane eons of punishment or loss of status 

Situation I and implicit attempts at leaving the test 
- It might be said that these responses are 


Self ya 
athe; 
her than task centered [p. 166]. 


are ones which 


Th 

e 

task-relevant responses 
arned 


Te d 
"aei the task, and must be le 
"esponse Sk performance. The self-oriented 
Which a are a class of generalized responses 
and eri evoked in a task situation, 
iy with the learning of specific 
responses. The TAQ, a 37-item 


Quest: 
Stionna; 7 
es naire in a rating-scale format, 1S 


ig 
5 E E: measure these self-oriented re- 
terna] he items refer to the kinds of 
the apia DOSES typically experienced by 
exami oject immediately before and during 

— and tests. 

in n (1958a) constructed a 21-item 
Test € of test anxiety, which he labeled the 
on jpg lety Scale (TAS). It is based largely 
Tapes taken from Mandler and S. Sarason's 
rewritten for a true-false format, The 


TA 

Produ and the TAS are highly correlated, 

soy CtMmoment correlation = -93 (I. Sara- 
Ne ds Sarason 


aj, cderson, & Nyman, 1968 

šis, e Stated the interfering response hypothe- 

tion ™phasizing that it is à habit interpreta- 
anxiety: 

t, states tha 

in the res 


low in anxiety differ 3 J 
hreatening 


Phi an 1 
Where activated by personally t 
Wit, “8S low scoring Ss may react to suc 
han Increased effort and attention to 
Orient igh scoring Ss respond to e: 203]. 


te 
ed, personalized responses [1960, P- 
focuses of 


Ag i 
hig arason noted, the attentional er when 
unge, 3nd. low-anxious subjects C... turn 

€r threat, The low-anxious subjects 


their attention to the task, while high-anxious 
subjects attend to their internal self-oriented 
responses. 

Alpert and Haber (1960) constructed a test 
anxiety questionnaire, the Achievement Anx- 
iety Test (AAT), which yields two measures. 
One of these is a 10-item debilitating anxiety 
scale, a construct similar to that measured 
by the TAQ and TAS. Total scores on the 
scale correlated .64 with total TAQ scores. 
The other measure is labeled facilitating anx- 
Mandler and S. Sarason (1952) had 
theorized that test anxiety is debilitating of 
performance only among subjects who had 
learned a habitual class of interfering re- 
sponses to the test anxiety. Among persons 
without this class of interfering responses, 
test anxiety elicits task-relevant responses 
leading to task completion. Persons scoring 
low on the TAQ were implicitly assumed by 
Mandler and Sarason to be high in the facili- 
tating kind of anxiety. Alpert and Haber’s 
AAT measures separately the two constructs 
of anxiety and their associated responses. The 
nine items on the facilitating anxiety scale 
are based on “a prototype of the item— 
‘Anxiety helps me to do better during exami- 
nations and tests [p. 213].’” The facilitating 
and debilitating scales correlated —.48 with 
each other (Alpert & Haber, 1960). 

Throughout this review, the terms “high 
test anxious” and “low test anxious” refer to 
persons who score at the high and low ex- 
tremes of measures of debilitating test anx- 
iety, such as the TAQ, TAS, or debilitating 
anxiety scale of the AAT. 

SELr-FOCUSING TENDENCIES OF HIGH- 

ANXIOUS SUBJECTS: EMPIRICAL 
EVIDENCE 

In a review of the literature on paper-and- 
pencil anxiety scales, I. Sarason (1960) cited 
a number of studies that provide evidence 
that high-anxious subjects are “more self- 
deprecatory, more self-preoccupied, and gen- 
erally less content with themselves than sub- 
jects lower in the distribution of anxiet 
scales [p. 404]." This conclusion was E 
marily based on the renis of studias wich 
iety scales and du. Minh SPOLES OF gue 
ality measures. In mis a iai aa Persone 

3 general, these investigations 


iety. 
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indicated that persons who score high on 
anxiety scales describe themselves in negative, 
self-devaluing terms on other pencil-and-paper 
measures as well. 

Two of the studies reviewed in this con- 
nection by I. Sarason reported relationships 
between test anxiety and responses of sub- 
jects in experimental situations. In one of 
these (Doris & S. Sarason, 1955) subjects 
differing in TAQ scores were arbitrarily failed 
on a number of tasks. Following the tasks, 
the subjects were required to rank order 
"blame statements," which included “self- 
blame" and “other-than-self” blame items. 
Highly anxious subjects blamed themselves 
for their failures significantly more than 
did low-anxious subjects. In the other 
study, Trapp and Kausler (1958) compared 
Wechsler-Bellevue digit-symbol performance 
and levels of aspiration of high- and low-test- 
anxious subjects. Through the actual perform- 
ance of the high- and low-TAQ groups did 
not differ, the levels of aspiration of high- 
anxious subjects became progressively lower 
over the four trials. On the last two trials, the 
high-test-anxious subjects had significantly 
lower level-of-aspiration scores than the low- 
anxious subjects. Though they did not ob- 
jectively perform more poorly than the low- 
TAQ subjects, the high-anxious subjects pro- 
gressively became more pessimistic about 
their performance over the four trials. 

A more recent study by Meunier and Rule 
(1967) reported results leading to similar 
conclusions regarding the test-anxious person's 
tendency to devalue his own performance, 
The effects of positive, negative, or no feed- 
back on the subjects’ confidence in their 
judgments of the length of lines were investi- 
gated. On no-feedback trials, highly test- 
anxious subjects rated their confidence level 
as low and comparable to their confidence on 
trials with negative feedback. In contrast, low- 
test-anxious subjects expressed high confidence 
in their judgment on no-feedback trials, and 
this level of confidence corresponded to the 
level expressed on positive feedback trials. 

Since publishing his review of anxiety scale 
studies, I. Sarason has completed several 
studies which further indicate the self-dep- 
recatory, self-ruminative tendencies of highly 
test-anxious subjects (Sarason & Ganzer, 


1962, 1963; Sarason & Koenig, 1965). In | 
each of these studies, an unstructured verbal 
conditioning paradigm was used with college 
students who scored at the extremes of the 
TAS. The subjects were required to describe 
themselves orally for approximately one-half 
hour. Nonreinforcement, reinforcement of 
negative self-references, and reinforcement of | 
positive self-references have been compared. 
Some of the conclusions from these experi- 
ments are: (a) Regardless of experimenta 
condition, highly test-anxious subjects gen 
erally describe themselves in more negative 
terms than do low-test-anxious subjects. (b) ‘ 
High-test-anxious subjects are extremely !€ 
sponsive to reinforcement when the response | 
class being reinforced is negative self-refe! 
ences. (c) However, when the response class 
being reinforced is positive self-reference 
high-anxious subjects do not condition. Th 
is, they do not produce more positive sel “4 
references as a result of being verbally rei!” 
forced for them. 

The conclusion that high-anxious perso"? 
are, indeed, generally self-deprecatory 4^ 
self-preoccupied seems inescapable in light 9 
the research evidence. But whether these self 
focusing tendencies are specifically activate 
in testing situations is a question which h4 
not been answered in the studies reviewed E 
this point. Four recent studies (Ganzer, 1968; 
Mandler & Watson, 1966; Marlett & Watso! 
1968; Neale & Katahn, 1968) provide eV 
dence bearing directly on this proposition. 

In the earliest of these studies, Mandle! 
and Watson (1966) administered a series A 
digit-symbol tasks to extreme low- and nigh- 
TAQ groups. A posttask questionnaire S 
cluded the question, “How often during t^ 
testing did you find yourself thinking ho* 
well, or how badly, you seemed to be dum. 
[p. 276].” On a 10-point rating scale, hi£- 
test-anxious subjects indicated marke®, 
greater occurrence of such thoughts than C 
the low-test-anxious group. In à pars. 
replication of this study, Neale and Kat? is 
(1968) reported identical 
questionnaire item, a 

Marlett and Watson (1968) reported 4 tj 
similar result, Ninth-grade males in high- ^4. | 
low-extreme groups on the children’s form” t 
the Alpert-Haber Achievement Anxiety 


results 0n 
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Eon Dember, & Stanford, 1963) were 
m» itrarily failed for 12 trials on a button- 
Elm A posttask questionnaire in- 
Bosh e question, “How often did you think 
The oe well or badly you were doing?” 
D Bes of the rating scale were labeled 
ede think about it," and “I thought 
b2-20: id much I couldn't concentrate [pp. 
Fsubject |. „The scores of high-test-anxious 
Lire ii indicated that they were significantly 
“a à isturbed by such self-focused thoughts. 
tion e more direct evidence for the activa- 
Bison self-oriented tendencies among high- 
in the SUME in test situations is provided 
effects nay reported by Ganzer (1968). The 
(TAS) ¢ audience presence and test anxiety 
subject: on the serial verbal learning of female 
made were investigated. Tabulations were 
the Jes ibn frequency and content of all of 
ey Jects task-irrelevant comments while 
E working on the task. * High TAS 
emitted especially in the Observed condition, 
analysi more than any other group. Content 
most] is revealed that the comments were 
Tin Y of a self-evaluative or apologetic 
te [Ganzer, 1968, p. 194].” 
t E "gra there is abundant evidence 
teocc € test-anxious person 1S more self- 
fi and self-deprecatory than his 
i ide. -anxious counterpart. It is also clear, 
Self-fo. of the research reviewed, that these 
Tess cused tendencies are activated by the 
ures of the testing situation. 


Struar 
TUATIONAL CONDITIONS AND THE TASK 


Prrrormance oF HicH- AND Low- 
TEsr-ANxIOUS SUBJECTS 


C" 
ried two initial studies published by the 
(Ma rs who originated test anxiety theory 
are & S. Sarason, 1952; S. Sarason, 
9f th ler, & Craighill, 1952) were prototypes 
Bons Many subsequent investigations of situ- 
TE stress variables and test anxiety. The 
i d predictions of test anxiety theory with 
d ae i 
to situational stress variables are: 
2 i : 
i cane talus situation contains elements which 
pet Y arouse test or achievement anxiety, this 
hus ance A anxiety drive will lead to poorer per- 
wiDonss, jn individuals who have task-irrelevant 
5 in their response repertory. For individuals 


q 0 : 
me. ts Such response tendencies, these stimulus 
5 Will raise their general drive level and result 


ay 


in improved performance. (Such elements would be 
any reference that the individual is being judged, a 
statement of expected performance, etc Sai s 

et al, 1932, p. 561]. diss 


Mandler and Sarason (1952) examined the 
effects of feedback and no-feedback on the 
performance of subjects scoring at extremes 
on the TAQ. As predicted, the two feedback 
conditions, both success and failure, facili- 
tated the performance of low-TAQ subjects 
and interfered with the performance of high: 
TAQ subjects. In the second study (Sarason 
et al., 1952) similar predictions were made 
concerning an instructional manipulation. In 
the stressful “ego-involving” instructional 
condition, subjects were told that they should 
easily complete the test within the time limit, 
when it was, in fact, made impossible for 
them to do so. This “expected-to-finish” con- 
dition can be described as an advance failure 
manipulation. The nonstressful instructions, 
“not-expected-to-finish,” informed the sub- 
ject that no one could expect to finish within 
the time limit. In general, the high-TAQ sub- 
jects did better following the “not-expected- 
to-finish” than the “expected-to-finish” in- 
structions, while the reverse was true for the 
low-anxious subjects. 

The many subsequent investigations of 
stressful situational conditions and test anx- 
iety have generally provi 


with the predictions of 
(Cox, 1966, 1968; Ganzer, 1968; Harleston, 


1962; Paul & Eriksen, 1964; Russell & I. 
Sarason, 1965; I. Sarason, 1958b, 1959, 1961; 
I. Sarason & Harmatz, 1965; I. Sarason & 
Minard, 1962; I. Sarason & Palola, 1960). 
Most of these studies have investigated the 
effects of feedback or instructional manipula- 
tions, though three were concerned with the 
effects of audience presence on task perform- 
ance (Cox, 1966, 1968; Ganzer, 1968). Con- 
sidering audience presence as a source of 
threatening evaluation, the u he bes 

i predictions of test anxiety 
apoie sep E debilitated the task 


theory. Audience presence i i 
performance of high-test-anxious subjects, 


and facilitated the performance of low-test- 
s subjects. | 

onal varia 
bove, have m 
arisons of (@) 


ded data consistent 
test anxiety theory 


anxiou: 

Instructi 
discussed à 
volved comp: 


tions, other than those 
ost frequently in- 
“ego-involving,” 
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“highly motivating” instructions, which inform 
the subject that his task performance will 
reflect his ability, (b) minimal task instruc- 
tions, and (c) instructions which assure the 
subject that his task performance will not be 
used to evaluate him. Frequently, the subject 
is told that his performance will be anonymous. 
He may be instructed that the experimenter 
is interested only in properties of the task, 
or that his performance is important only so 
that the experimenter may complete the study. 

The results of studies varying instructional 
conditions have generally reported an inter- 
action between level of test anxiety and evalu- 
ation emphasis: (a) Highly test-anxious sub- 
jects perform more poorly following highly 
evaluative “ego-involving” instructions than 
nonevaluative “anonymous” instructions while 
the reverse is true for low-test-anxious sub- 
jects. (5) Following highly evaluative instruc- 
tions, low-test-anxious subjects perform better 
than high-test-anxious subjects; following 
nonevaluative instructions, high-test-anxious 
subjects perform better than low. (c) Fol- 
lowing minimal task instructions, high- and 
low-test-anxious-subjects perform at about 
equivalent levels, intermediate between their 


performances in the highly evaluative and 
nonevaluative conditions 


All of the situational 
ferentially affect th 


conditions which dif- 


dimension affect 
low-test-anxious 
Persons in opposing ways. It might also be 
noted that situational conditions varying on 
this evaluative dimension have Opposing ef- 
fects on the direction of attention of high. 
and low-test-anxious persons. Highly evalua- 
tive conditions cause the highly test-anxious 
person to direct his attention internally rather 
than to the task. The same conditions cause 
the low-test-anxious person to increase his 
“attention to the task at hand [T Sarason, 
1960, p. 405].” Nonevaluative situational 
conditions do not elicit the highly test- 
anxious subject's self-directed interfering xe 
sponses, thus it is possible for him to direct 
his full attention to the task. Conversely, non- 


evaluative testing conditions presumably a 
not excite the nonanxious subject’s motives 
and interest, Though it cannot be said e is 
therefore turns his attention inward, " 
probable that his low interest in the tio] 
is accompanied by less than full atten 
to it. rf 
I. Sarason (1958b) has investigated € | 
instructional variation, which he labe scr 
“reassurance” condition, ‘These inst 
like the nonevaluative ones discussed E 1 
were designed to improve the performant 
high-test-anxious subjects rather than ma 
fere with it. In spite of the “geass te 
label, these instructions can most accurate 
be described as neutral with respect to 
evaluative dimension, since there was no 
erence in them to evaluation of the ca 
performance. Their nature might mos and | 
rately be described as information-givi0e - tol 
attention-directing. The subject's task o A 
learn a serial list of nonsense syllables? task 
lowing the standard serial learning ym 
instructions, the subject was warned this | 
many people become tense when leat nis | 
kind of task and that he should ee thal, 
Progress to be slow. He was further tok tha 
he should concentrate on the list rather hit 
Worrying about how he was doing. Whena 
set of instructions was compared to sr ae 
or minimal task instructions, high-test-an? 564 
subjects performed better following the ondi; 
surance instructions. The reassurance es Í 
tion was detrimental to the performan Sa | 
the low-test-anxious subjects, in compar gl 
to the standard task instructions. ens 
study suggests that it may not be ac asi 
to resort to removing the evaluative emp a 
from the highly anxious subject's perfor i4 
ance in order to improve it. The results i 
cate that it may be sufficient to subst ind 
task-relevant responses for the self-orie? ar 
ones by means of rather straightforw | 
attention-directing instructions. 


ref- l 


S wA 


ON | 
ANXIETY AND RANGE OF Cur UrIzATIO | 


n 
It was suggested i ib. 


B n the preceding sect! 
that evaluative testing conditions have OP 
posite effects on the attentional focuses NM 
high- and low-test-anxious Persons, W 
being evaluated, the high-test-anyious per? 
turns his attention inward while the low-te" 
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n po e focuses more fully on the task. 
Person Ph ag is that the high-test-anxious 
the low-t ends to fewer task cues than does 
interpret ae person. This attentional 
task m ion of the effects of test anxiety on 
pirical ormance Js consistent with an em- 
brook co advanced by Easter- 
į tween a 33) concerning the relationship be- 
lorem level and task variables. He 
cated th a large body of research which indi- 
rows emotional arousal consistently nar- 
Derforn ls range of cue utilization 1n task 
nance, 

E see following are examples of the research 
ion: Pe cited to support this generaliza- 
Menta Studies of the effects of experi- 
toa al ee a level on response 
Peripheral task, and simultaneous response to 

TOusa] al, only occasionally relevant stimuli. 
stimuli reduces responsivity to the peripheral 

i tal i, while maintaining or improving cen- 

i cer ge (Bahrick, Fitts, & Rankin, 
brook ont 1958: Davis, 1948; Easter- 
fmotiona] 2)" (b) Studies of the effects of 

€arnin Fie ge on measures of incidental 
of "ed Again arousal reduces the amount 

On or iie learning, while having no effect 
53. YN intentional learning (Aborn, 

Papanek ahrick, 1954; Bruner, Matter, & 

Silverm » 1955; Johnson, 1952; Kohn, 1954; 

hese an, 1954; Silverman & Blitz, 1956). 

Basterbrs, only a few of the studies cited by 
rook in support of his formulation. 

“asterbrook’s definition of emotional 


ar . 
th ®usal is very broad: 
wi 
fip nate response to a state o è 
Occurs A noxious stimulation, which underlies or 
its Meere cone with overt action and affects 
Breater gth and course. This emotional arousal is 
an u in neurotic than in normal subjects. greater 
e in subjects under stress or threat or in 
have ee and in general greater in animals that 
tions, cen “motivated” by any of the usual depriva- 
it is ^ Noxious stimulations, or other incentives than 
n unmotivated or resting animals 0 


Sbecies [p. 184] 
e b 
atone is one aspect of this emotional 
review, A mengei and several of the studies 
ing is by Easterbrook indicate that narrow- 
anxie e range of cue utilization accompanies 

lety arousal. 


I Since ‘Rasterbrook’s review) several studies 
dave reported the effects of experimentally 


R- 


f biological depriva- 


f the same 


aroused anxiety on range of cut utilization in 
task performance (Agnew & Agnew, 1963; 
Bruning, Capage, Kozuh, Young, & Young, 
1968; Tecce & Happ, 1964; Wachtel, 1966, 
1968). The results of these investigations 
have been consistent with Easterbrook’s 
formulation. Stressful situational conditions 
tended to reduce the range of cue utilization, 
in comparison to nonstressful conditions. No 
measures of individual differences were used 
in these studies. 

Zaffy and Bruning (1966), West, Lee, and 
Anderson (1969) and Wachtel (1966, 1968) 
reported relationships between scale-measured 
level and range of cue utilization. 
selected subjects from 
extreme scorers on the Taylor MAS. Rather 
than varying range of cues, they made the 
available cues either relevant or irrelevant to 
task performance. The subjects’ task was to 
learn a correct position for each of 19 presen- 
tations of 5 zeros, presented by a memory 
drum. In the relevant cue condition the zeros 
were numbered in order with subscripts from 
1 to 5; the same subscripts were used in the 
irrelevant cue condition but in random order. 
These conditions were compared to a no-cue 

he performance of the 


control condition. T 
low-MAS subjects was affected more by the 


presence of the cues, whether relevant or 
irrelevant, than was the performance of the 
high-MAS subjects. Within the low-MAS anx- 
iety level, there were differences significant 
beyond the .001 level in all three of the com- 
parisons of the three conditions, with superior 
performance in the relevant cue condition, 
intermediate performance in the no-cue condi- 
tion, and poorest performance in the irrelevant 
cue condition. The performance levels of the 
high MAS subjects followed the same order, 
but there was à significant. difference only 
between the relevant and irrelevant cue 
conditions (? < .025). 

More pertinent to the purposes of this re- 
view are the studies reported by West et al. 
(1969) and Wachtel (1966, 1968), since they 
selected subjects on the basis of their scores 
on measures of test anxiety. West et al. used 
sixth and eighth graders who were high and 
scorers On the Test Anxiety Scale for 
TASC; S. Sarason, Davidson 
aite, & Ruebush, 1960). The taste 


anxiety 
Zaffy and Bruning 


low 
Children 
Lighthall, W: 
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was made up of arithmetic problems. There 
Were two conditions determined by the nature 
of the information given in the problems. In 
one condition only information essential to 
solution of the problems was given; in the 
other condition the same relevant information 
was provided, but additional irrelevant infor- 
mation was given in each problem. The inter- 
action. between anxiety level and condition 
was significant. Both high- and low-TASC 
groups performed more poorly in the irrele- 
vant information condition. No tests of sig- 
nificance for individual pairs of means were 
reported; the means for the groups indicate 
that the performance level of the low-TASC 
group was affected more by 
irrelevant cues than was the performance level 
of the high-TASC group: low TASC, relevant 
information only, 
relevant, 


plus irrelevant, 
us children were 
on of irrelevant 


tionships between test anxiety level and range 
performance on a 
central continuous tracking task and reaction 
times to two occasional peripheral lights. A 
control condition with minimal task instruc- 
tions was compared with three experimental 
(a) Subjects told that unavoid- 
c shock would be administered 
during task performance; (5) an avoidable 
Shock condition in Which subjects were in- 
formed that they could avoid shock if their 
Combined performance on the central and 
Peripheral tasks remained high; (c) a per- 
sonality diagnostic condition in which the 
task was Presented as a means of evaluating 
the subjects personality; the subject was told 
that the periphera] lights would go on only 
if his performnce fell below an acceptable 
level. . 
Though Wachtel’s study was primarily 
focused on the effects of experimentally ma- 
nipulated stress on performance, he reported 
interesting relationships between subjects 
TAQ scores and task performance. THE 
"avoidable shock" and "personality diagnosis 
conditions were the Only ones in which test 
anxiety was appropriately engaged through 
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threatening the subject with evaluation of E 
performance. In both of these conditions ps 
was evidence of narrowing attention to Me. 
cues as a result of test anxiety. In the avol! j 
able shock condition there was a nearly P 
nificant r of —.35 (p< .10) between MU 
Scores (reciprocals of reaction times to a 
occasional peripheral lights) and E 
TAQ scores, and a significant positive e 
relation between TAQ scores and scores T 
the central tracking task (r = .58, p< d 
In the personality diagnosis condition R 
correlation between speed scores and T. 3 
Scores was —.44 (p< 05), though the vi 
relation between TAQ scores and ww 
Scores in this condition was only .02. W eu 
all subjects in these two conditions he 
combined, the correlation between TAQ a. ). 
and speed scores was —.40 (n = 30, p € “ety 
Thus in these two conditions test ene 
reduced responsiveness to the nd 
lights. In the other two conditions, in W f 
test anxiety was not appropriately arous ai 
there were no significant correlations betwee 
TAQ scores and task performance. lait 
A proposition offered by Wachtel to exp * 
the effects of experimentally manipulat 


: . P . m- 
anxiety and test anxiety in this study CO i 


. ation 
bines Easterbrook’s attentional formulatio" , 


and the interfering response hypothesis of 2 i 
test anxiety theorists: “when an individua a 
anxious, attention is diverted inward to de 
ception of his anxiety and therefore. ulis 
attention is available for external stimuli. 
[Wachtel, 1966, p. 2149].” ^ 


ENTS ^ 
Worry AND EMOTIONALITY: COMPONEN 
or TEST ANXIETY 


Liebert and Morris (1967) have presented | 
an analysis of test anxiety which is qu! | 
consistent with the one proposed in this pape": | 
They suggest that test anxiety is compose 
of two major aspects: worry and emotionality: 
The worry (W) component is described 45 
cognitive concern over Performance, a” 
emotionality (E) is the autonomic arousa 


a series of 
studies (Doctor & Altman, 1969; Liebert & 
Morris, 1967; Morris & Liebert, 1969. 1970; 

p" 


Spiegler, Morris, & Liebert, 1968) suggest 


tha; 
Dire E the two components vary pre- 
room kan h temporal relationships to class- 
E onc. ong and with performance 
Severa] LE The scale has been administered 
?tely before before an examination, immedi- 
E tend " and immediately after. Scores on 
While Ẹ ba be fairly constant across time; 
ore an ades reach a peak immediately be- 
(Mediate xamination, falling off rapidly im- 
iguficantly o the examination. W scores are 
Subjects? y and negatively correlated. with 
Ce ‘ex preexamination ratings of perform- 
felateq (o nee? while E scores are not 
hree performance expectancy. l 
T"esults o studies in this series have yielded 
motional the relationship between worry and 
N Altm: ity and task performance (Doctor 
Lu DEN Mens & Liebert, 1969, 
Boon, Morris and Liebert (1969) devised 
a Myiety Paper-and-pencil measure of the two 
items pag iie this one composed of 
ance | rom the Taylor MAS. The perform- 
dult 1 was a total of five Wechsler 
2% 9 ntelligence Scale (WAITS) subtests. A 
Subject torial design was used: half of the 
each e completed only the easy items in 
Eon Ibtest, half the difficult items; half of 
Subjects were aware of being timed, half 


Ver 3 E 
Es not. There were significant interactions 
Veen worry scores and both task difficulty 
total WAIS 


al a 
co, Uming in their effects on : 
Volvin, There were no significant results in- 
| The "E emotionality and 
E. ors suggest that 
antel] Y, which affects 
acts ectual-cognitive tasks | 
itu With the relevant variables 0 
ation [pp. 243-244]." 
age and Altman (19 
lonships between W 


“it is worry, 
performance 


and which inter- 
f the test 


69) examined the 


8 
n 
» and E scores on 


| 
| 


'ebert and Morris’ (1967) scale and final 
wmination grades in à psychology course. 
antly and 


d to be signific 
.CRatively related to exam performance. 
“ores were negatively related to performance 
wy among subjects below the group kx 
W score. When W was high, the a air 
Of E had little or no effect on perform ila 
Teu dee ‘ 70) volving college 
: «sh school stu- 


higl 
computed be- 


p, Sores were foun 


Undergraduates, 3 re 
| lents. Partial correl 
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tween worry and emotionality scores on the 
Liebert-Morris (1967) scale, which was ad- 
ministered immediately before a course final 
examination and examination grades. In both 
studies, partial correlations between worry 
scores and final examination grades were nega- 


tive and significant. Partial correlations be- 


tween emotionality scores and grades were 


nonsignificant. 

The worry component seems closely related 
to an attentional interpretation of test anx- 
iety, which proposes that the adverse effects of 
test anxiety are due to attention being divided 
between self and the task. Worry has been 
described as “preoccupation with performance 
[Doctor & Altman, 1969, p. 564],” “cognitive 
concern about the consequences of failing, the 
ability of others relative to one’s own, etc. 
[Liebert & Morris, 1967, p. 975]," “cognitive 
or intellectual concern about one’s own per- 
formance [Spiegler et al., 1968, p. 451].” 
These descriptions all refer to attentionally 
demanding cognitive activity going on during 
task performance. This is not to say that auto- 
nomic activity is not demanding of attention 
as well, but it is less likely to require atten- 
tion except “at high levels where physiological 
reactivity might be distractive and annoying 
[Doctor & Altman, 1969, p. 364]." 


CONCLUSIONS AND IMPLICATIONS 


The literature reviewed supports an atten- 
tional interpretation of the debilitating ef- 
fects of test anxiety on task performance. The 
highly test-anxious person responds to evalu- 
ative testing conditions with ruminative, self- 
evaluative worry and, thus, cannot direct ade- 


quate attention to task-relevant variables. An 
attentional analysis is closely related to exist- 
ing theories of test anxiety. Mandler and 
Sarason’s (1952) original theoretical position 
assumed that anxiety interferes with the per- 
formance only of subjects who have a 
habitual class of self-oriented interfering re- 
sponses in their response repertoires. For sub- 
jects without this class of interfering re- 
sponses, anxiety may facilitate task-relevant 
responses. Alpert and Haber's Achievement 
Anxiety Test was designed to provide inde- 
pendent measures of these two classes of re- 


sponse to test anxiety. Liebert and Morris? 
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(1967) analysis of test anxiety into the two 
components of worry and emotionality is a 
logical extension of test anxiety theory. Emo- 
tional arousal appears to bear no consistent 
relationship to performance on intellectual or 
cognitive tasks, while worry is consistently 
and negatively related to performance. An 
attentional interpretation states simply that 
the reason “worry” debilitates task perform- 
ance is that it is attentionally demanding and 
distracts attention from the task. 

Before moving into specific research and 
treatment applications, a few implications of 
the attentional analysis of test anxiety are 
stated below: 

l. An attentional approach is explicitly 
concerned with how the subject uses his task 
time—his cognitive activity, what he is 
thinking about and attending to. 

2. This approach implies little interest in 
autonomic arousal per se. In this context, 
degree of arousal is irrelevant unless the sub- 
ject is attending to his arousal. Of course, as 
noted earlier, when arousal becomes quite 
extreme, it is attentionally demanding. 

3. Finally, this analysis implies that the 
test-anxious person's performance may be 
improved by directing his attention to task- 
relevant variables, and away from self-evalua- 
tive rumination. 


Experimental Applications 


Though test anxiety theorists have not 
viewed test anxiety as a unitary dimension 
of emotional arousal, the experimental studies 
in this area have typically been concerned 
with manipulations of the evaluative situa- 
tional conditions which evoke test anxiety. 
With two notable exceptions (Sarason et al., 
1968; Sieber, 1969) studies have not investi- 
gated conditions designed to alter the inter- 
fering responses elicited by the anxiety. Re- 
search manipulating evaluative conditions has 
been necessary in order to establish test anx- 
iety as a theoretical construct and to validate 
measures of test anxiety. These goals have 
been realized; it seems reasonable to suggest 
that an appropriate next stage for research in 
this area is one in which the evaluative dimen- 
sion is held constant, and in which a search 
is begun for variables that will alter the 
test-anxious subjects’ interfering responses. 
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An attentional analysis of test anxiety C! | 
provide some direction to such a search. 

There is a large body of literature in selec- 
tive attention, which has recently been No 
viewed by Egeth (1967). The basic premise 
of the studies in this research area is that 
subjects’ attention can be differentially di- 
rected to specific stimuli or stimulus attributes 
and away from others. The nature of the 
mechanisms involved, the completeness of the 
selectivity, and the level at which the selec- 
tion takes place are matters of some dispute 
(e.g., Broadbent, 1958; Deutsch & Deutsch, 
1963; Treisman, 1964). It is, however, emy 
pirically established that persons can be M- 
structed to be selectively more attentive t0 
specific stimulus attributes or dimensions an 
to be less attentive to others. Moreover, with 
repeated training under attention-directing 1" 
structions, subjects become more and more 
skilled in attending to the “relevant” stimuli 
and ignoring the “irrelevant” stimuli. 

The selective attention literature has not 
been concerned with individual differences: 
There is no evidence, therefore, on how test 
anxious persons might respond to attention- 
directing instructions. There is a good dea! / 
of evidence, however, which indicates that 
highly test-anxious persons are very respon 
sive to social cues of all sorts, including verbal 
reinforcement (I. Sarason & Ganzer, 1962; 
1963) modeling cues (I. Sarason et 4! 
1968), persuasion (Janis, 1955), and con 
formity pressures (Meunier & Rule, 1967): 
as well as evaluative instructional manipula 
tions. It seems a plausible hypothesis that 
test-anxious subjects should be susceptible t° 
attentional instructions directing them 
selectively attend to task-relevant variables 
and to ignore self-relevant variables. t 

Instructional manipulations are the mo$ 
obvious means of directing subject's attention 
to task-relevant variables, but they are ne 
the only available means. For examples 
modeling has been demonstrated as à very | 
effective means of transmitting new behavio! 
(e.g, Bandura, 1969: Bandura & wale 
1963). I. Sarason et al (1968) and I. Saraso" 
(1968) have reported studies suggesting ma 
the verbal learning of highly test-anxious E 
jects is facilitated by prior exposure to pev 
nesslike live models learning similar seri 


1 


lists. The use of models who transmit task- 
attending, nonworrying cues may be an ef- 
fective technique for other kinds of learning 
Situations. 

Task cues themselves may provide another 
means of manipulating the attention of test- 
B us subjects. Sieber (1969) has reported 
nim in which high- and low-test-anxious 
RA worked on a multistage task. In one 
pla A ion, they were provided with visual dis- 
fà E of all of the earlier stages of the task; 
The he condition, there were no displays. 

ighly test-anxious children performed 
Eu in the visual display than the no- 
ifere condition, while the displays had no 
esi ntial effect on the performance of the 
me eng children. Seiber described 
Br as as “memory supports. They 
ing also be described as means of sustain- 
oa to task-relevant variables. It 
sup be possible to develop such. memory- 
ni porting or "attention-sustaining. tech- 

mM for use with other tasks and age levels. 
E point, the research possibilities dis- 
Of m AVE been concerned only with means 
Kr Pilating the attention of highly test- 
exhaust persons. These suggestions do not 
an s the research possibilities implied by 
Sugge entional approach. The approach also 
Ea analysis of other areas, for instance, 
Manne designed to examine 1n detail the 

i. in which high- and low-test-ansou® 
Derg, ns deploy their attention during tas 

Ormance, Such research would inevitably 
s ^d concern with the properties of specific 
on and the attentional demands they make 

Subjects, 


— ee 


Tw, 
ment Applications 

E Purpose of this section is not to review 

a d detail the treatment literature 1n test 

Me Cty, but rather to suggest possible treat- 

ü 

tr 


ac cChniques which might be derived from 
entional approach. The test anxiety 
‘Ve ment literature is described briefly, how- 
an p, D order to provide some contrast with 
€ntional treatment approach. 

e the published accounts of treatme 
tig, St-anxious subjects have involved varia- 
(e hw Systematic desensitization techniques 
n, 1969; Donner & Guerney, 1969; 
& Krumboltz, 1967; Garlington & 


y 
| 


tment 
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Cotler, 1968; Thli & Garlington, 1969; John- 
son & Sechrest, 1968; Katahn, Strenger, & 
Cherry, 1966; Konda§, 1967; Laxer, Quarter, 
Kooman, & Walker, 1969; Snider & Oetting, 
1966: Suinn, 1968). This literature has con- 
tributed a graet deal to analysis of the rela- 
tive efficacy of various components of the 
systematic desensitization procedures, for in- 
stance, standardized versus individualized 
anxiety hierarchies, partial versus full hier- 
archies, relaxation only versus full desensi- 
tization, etc. The research indicates in gen- 
eral that systematic desensitization has bene- 
ficial effects on self-reported measures of test 
anxiety, and often, on scholastic performance. 
Therefore, it is not with any intent to 
minimize the value of this treatment literature 
that it is noted that these studies have evolved 
from interest in specific treatment techniques, 
rather than from an analysis of the nature 
and effects of test anxiety. Test anxiety has 
been considered to be a severe problem, and 
thus its treatment is considered a stringent 
test of systematic desensitization procedures. 
Tt has implicitly been assumed that test anx- 
iety differs only in degree from the specific 
anxieties and phobias (e.g. rat, spider, snake) 
dealt with in much of the behavior modifica- 
tion literature. It should also be noted that 
by training test-anxious subjects to relax in 
the presence of progressively more stressful 
stimuli, a systematic desensitization treatment 
approach assumes that the emotional arousal 


component of test anxiety is its defining 


characteristic. . ; 
The attentional formulation suggests quite 


a different approach to treatment of test- 
anxious persons. First of all, the specific con- 
cern with how highly test-ansious subjects 
use their task time suggests treatment in 
which subjects are given intensive practice In 
dealing with tests. Second, this task practice 
should be accompanied by instructions to 
focus fully on the tasks and to inhibit self- 
relevant thinking. In à pilot study, the author 
(Wine, 1970) has compared the relative effi- 
cacy of attentional training of the kind de- 
scribed, attentional training plus relažation 
training, and à tgelf-explorational" therapy 
approach with test-anxious undergraduate stu- 
dents. Each treatment condition was gon 
ducted over six hourly sessions. Test-anxious 
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subjects in the attentional training condition 
changed positively and significantly on self- 
report measures of test-related anxiety, as 
well as on two standardized performance mea- 
sures. The performance measures were Forms 
1 and 2 of the Wonderlic Personnel Tests 
(Wonderlic, 1959) and matched forms of a 
digit-symbol test? The attentional training 
plus relaxation also produced positive changes, 
but added nothing beyond attentional train- 
ing alone. Subjects in the self-explorational 
condition did not change significantly on any 
measures. 


A larger, more adequate treatment study is 
being planned for the near future. Though the 
results of this pilot study are no more than 
suggestive, one of the suggestions is that at- 
tentional training not only improves task per- 
formance but lowers reported test anxiety level 
as well. In conclusion, perhaps it is possible to 
define test anxiety attentionally. If a person 
is not attending to his test anxiety, it, in 
effect, does not exist. *My experience is what 


l agree to attend to [William James, 1890, 
p. 403].” 


? The digit-symbol tests were selected from a bat- 
tery of eight, constructed and standardized by 
Marvin Brown, University of Waterloo, 1968. 
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cerning research decisions. 


I 
! oe you have run an experiment on 20 
sult een have obtained a significant re- 
« 05 1 : confirms your theory (z = 2.23, p 
run al i ee i You now have cause to 
b ins CUM group of 10 subjects. What 
sults "^ —- the probability is that the re- 
separ e significant, by a one-tailed test, 
i ately for this group?" 
Y Fin feel that the probability is some- 
noy is ound .85, you may be pleased to 
ndeed i you belong to a majority group. 
smal] Es hat was the median answer of two 
| bond pere who were kind enough to re- 
ings or Eis questionnaire distributed at meet- 
nd of Te Mathematical Psychology Group 
tion, he American Psychological Associa- 
hs, e other hand, if you feel that the 
inority y is around .48, you belong to a 
Users 1 Only 9 of our 84 respondents gave 
Dpens between .40 and .60. However, 48 
at to be a much more reasonable estl- 
; Me than 85: 


( 


git 
he ordori d 
thank ordering of authors is random. We wish to 


l 
Mente the many friends and colleagues who com- 
Me inq ER an earlier version, and in particular we 
gohen ebted to Maya Bar-Hillel, Jack Block, Jacob 
mug Louis L, Guttman, John W. Tukey, Ester 
T "quests Gideon Shwarz. 
io sky à ior reprints should 
DOES enter for Advanced Stu 
Orq, œ ences, 202 Junipero Serra Boulev: 
r opp omia 94308. 
a way, ied estimate can be interpreted in sev- 
aye ip: Mie possible approach is to follow. com- 
thee is pus practice, where a value obtained in one 
qu, aken to define a plausible alternative to 
DU Ej Dothesis. The probability requested in the 
a Ae then be interpreted as the power of the 
cant p (ie, the probability of obtaining a sig- 
Native we in the second sample) against the 
sam © hypothesis defined by the result of the 
Ple. In the special case of a test of a mean 


be sent to Amos 
dy in the Behav- 
ard, Stan- 


THE LAW OF SMALL NUMBERS 
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People have erroneous intuitions about the laws of chance. In particul hey 
regard a sample randomly drawn írom a population as vehe dr > 
that is, similar to the population in all essential characteristics. The preval x 
of the belief and its unfortunate consequences for psychological leen xum 
illustrated by the responses of professional psychologists toa questionnaire am 


Apparently, most psychologists have an ex- 
aggerated belief in the likelihood of success- 
fully replicating an obtained finding. The 
sources of such beliefs, and their consequences 
for the conduct of scientific inquiry, are what 
this paper is about. Our thesis is that people 
have strong intuitions about random sam- 
pling; that these intuitions are wrong in fun- 
damental respects; that these intuitions are 
shared by naive subjects and by trained sci- 
entists; and that they are applied with un- 
fortunate consequences in the course of sci- 
entific inquiry. 

We submit that people view a sample ran- 
domly drawn from a population as highly 
representative, that is, similar to the popula- 
tion in all essential characteristics. Conse- 
quently, they expect any two samples drawn 
from a particular population to be more simi- 
lar to one another and to the population than 
sampling theory predicts, at least for small 
samples. 

The tendenc 
resentation is m 


situations. When su 
a random sequen 


y to regard a sample as a rep- 
anifest in a wide variety of 
bjects are instructed to 
ce of hypothetical 


generate 

tosses of a fair coin, for example, they pro- 

duce sequences where the proportion of 
ould compute the power 


ance, one W' 
t the hypothesis that the population 
mean equals the mean of the first sample. Since the 
size of the second sample is half that of the first, the 
computed probability of obtaining 221.645 is only 
473. A theoretically more justifiable approach is to 
jnterpret the requested probability within a Bayesian 
framework and compute it relative to some appropri- 
ately selected prior distribution. Assuming a uni- 
form prior, the desired posterior probability is 418, 
Clearly, if the prior distribution favors the null hy- 

hesi often the case, the posterior proba- 


pothesis, as is 
bility will be even smaller. 


with known vari 
oi the test agains 
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heads in any short segment stays far closer 
to .50 than the laws of chance would predict 
(Tune, 1964). Thus, each segment of the re- 
sponse sequence is highly representative of 
the "fairness" of the coin. Similar effects are 
observed when subjects successively predict 
events in a randomly generated Series, as in 
probability learning experiments (Estes, 1964) 
or in other sequential games of chance. Sub- 
jects act as if every segment of the random 
sequence must reflect the true proportion: if 
the sequence has strayed from the population 
proportion, a corrective bias in the other direc- 
tion is expected. This has been called the 
gambler’s fallacy. 

The heart of the gambler’s fallacy is a mis- 
conception of the fairness of the laws of 
chance. The gambler feels that the fairness of 
the coin entitles him to expect that any devi- 
ation in one direction will soon be cancelled 
by a corresponding deviation in the other. 
Even the fairest of coins, however, given the 
limitations of its memory and moral sense, 
cannot be as fair as the gambler expects it to 


be. This fallacy is not unique to gamblers. 
Consider the following example: 


The mean IQ of the population of eighth graders 
in a city is known to be 100. You have selected a 
random sample of 50 children for a study of educa- 
tional achievements. The first child tested has an 


IQ of 150. What do you expect the mean IQ to be 
for the whole sample? 


The correct answer is 101. A surprisingly large 
number of people believe that the expected IQ 
for the sample is still 100. This expectation 
can be justified only by the belief that a 
random process is self-correcting. Idioms such 
as “errors cancel each other out” reflect the 
image of an active self-correcting process, 
Some familiar processes in nature obey such 
laws: a deviation from a stable equilibrium 
Produces a force that restores the equilibrium. 
The laws of chance, in contrast, do not work 
that Way: deviations are not canceled as 
sampling Proceeds, they are merely diluted. 
Thus far, we have attempted to describe 
two related intuitions about chance. We pro- 
posed a representation hypothesis according to 
which people believe samples to be very simi- 
lar to one another and to the population from 
which they are drawn, We also suggested that 
people believe sampling to be a self-correcting 
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process. The two beliefs lead to the same E 
sequences. Both generate expectations ans A 
characteristics of samples, and the variabili 
of these expectations is less than the tru 
variability, at least for small samples. ; 
The law of large numbers guarantees the 
very large samples will indeed be highly rei 
resentative of the population from which b 
are drawn. If, in addition, a self-correctl 
tendency is at work, then small samples you 
also be highly representative and dung E 
one another. People’s intuitions about ran 
sampling appear to satisfy the law of B 
numbers, which asserts that the law of is bi / 
numbers applies to small numbers as wel lives 
Consider a hypothetical scientist who HO 
by the law of small numbers. How would a | 
belief affect his scientific work? Assume : a 
scientist studies phenomena whose magnis, à 
is small relative to uncontrolled variabil A 
that is, the signal-to-noise ratio in the ™ d 
sages he receives from nature is low. Our $ 23 
entist could be a meteorologist, a pharm 
cologist, or perhaps a psychologist. E 
If he believes in the law of small number 
the scientist will have exaggerated confident: | 
in the validity of conclusions based on s i 
samples. To illustrate, suppose he is enges i 
in studying which of two toys infants S 
prefer to play with. Of the first five it 
studied, four have shown a preference fot me 
same toy. Many a psychologist will feel nee 
confidence at this point, that the null hyper | 
sis of no preference is false. Fortunately, “or 
a conviction is not a sufficient condition : 
journal publication, although it may do fo w] 
book. By a quick computation, our psycho | 
gist will discover that the probability © 
result as extreme as the one obtained 5 
high as 3/8 under the null hypothesis. | n 
To be sure, the application of statis. 
hypothesis testing to scientific inference ie 
beset with serious difficulties, Nevertheless cr, | 
Computation of significance levels (or li fef. 
hood ratios, as a Bayesian might e 
forces the scientist to evaluate the obte n] 
effect in terms of a valid estimate of samP iv 
variance rather than in terms of his supe ort 
biased estimate. Statistical tests, there er] 
protect the scientific community against M 
hasty rejections of the null hypothesis 57,5 
Type I error) by policing its many men 


Og DS à 
fes,  YChologists typically expec 
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ftir rather live by the law of small 
comparabl p tie other hand, there are no 
failing : e safeguards. against the risk of 
(ie Aaa confirm a valid research hypothesis 
Im HE II error). 
Relation b : psychologist who studies the cor- 
NN oo ge ea and 
rm hen deciding on sample size, he may 
expect? as follows: "What correlation do I 
the HE ei 35. What JV do I need to make 
B3. Fine — (Looks at table.) N = 
E that's my sample." The only flaw in 
qM px is that our psychologist has for- 
Cause he pi esi variation, possibly be- 
ighly re elieves that any sample must be 
ever, if en of its population. How- 
Populati ns guess about the correlation in the 
Sample I is correct, the correlation in the 
35. E^ about as likely to lie below or above 
hifican ne the likelihood of obtaining a sig- 
Nus : result (i.e., the power of the test) for 
In 33 is about .50. 
Power se investigation of statistical 
Dlausible ps (1962, 1969) has provided 
Small eff efinitions of large, medium, and 
ional as and an extensive set of computa- 
Variety PA to the estimation of power for a 
Or a diff statistical tests. In the normal test 
ample erence between two means, for ex- 
Nee J^ difference of .250 is small, a differ- 
v js mag is medium, and a difference of 
tions, me according to the proposed defini- 
and ser ie mean 1Q difference between clerical 
c niskilled workers is a medium effect. 
ohen ingenious study of research practice, 
analyse (1962) reviewed all the statistical 
na of cheap in one volume of the Jour- 
tompy Abnormal and Social Psychology, and 
th ited the likelihood of detecting eac of 
üs ae sizes of effect. The average power 
for for the detection of small effects, -48 


} Medium effects, and .83 for large effects. 
t medium ef- 


in the above 


8 
ty, 8nd select sample size as 
tudies should 


Aa 
inq €, the power of their 5 
ch be about .50. 
iow ep analysis shows that the statistical 
mn ly s many psychological studies is ridicu- 
al ay This is a self-defeating practice: 
tes for frustrated scientists and ineffi- 
earch, The investigator who tests à 


Ypothesis but fails to obtain significant 
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results cannot help but regard nature as un 
trustworthy or even hostile. [Oma z 
Overall (1969) has shown, the prevalence of 
studies deficient in statistical power is not onlv 
wasteful but actually pernicious: it results in 
a large proportion of invalid rejections of the 
null hypothesis among published results. 
Because considerations of statistical power 
are of particular importance in the design of 
replication studies, we probed attitudes con 
cerning replication in our questionnaire. 


Suppose one of your doctoral students has com- 
pleted a difficult and time-consuming experiment on 
40 animals. He has scored and analyzed a large num- 
ber of variables. His results are generally inconclusive, 
but one before-after comparison yields a highly EE 
nificant 4 — 2.70, which is surprising and could be of 
major theoretical significance. 

Considering the importance of the result, its sur- 
prisal value, and the number of analyses that your 


student has performed— 


commend that he replicate the study 
If you recommend replication, 
ld you urge him to run? 


Would you re 

before publishing? 

how many animals wou 

Among the psychologists to whom we put 
these questions there was overwhelming senti- 
ment favoring replication: it was recom- 
mended by 66 out of 75 respondents, probably 
because they suspected that the single sig- 
nificant result was due to chance. The median 
recommendation was for the doctoral student 
to run 20 subjects in a replication study. It is 
instructive to consider the likely consequences 
of this advice. If the mean and the variance 
in the second sample are actually identical to 
those in the first sample, then the resulting 
value of t will be 1.88. Following the reason- 
ing of Footnote 2, the student’s chance of ob- 
taining a significant result in the replication 
is only slightly above one-half (for p= .05, 
one-tail test). Since we had anticipated that 
a replication sample of 20 would appear rea- 
sonable to our respondents, We added the fol- 


lowing question: 


our 


Assume that y unhappy student has in fact 
E ] stu. 


repeated the initial dy with 20 additional animals, 
and has obtained an insignificant result in the same 
rod —124. What would you recommend 
dire eet [the numbers in parentheses refer 
ts die number o ndents who checked each an- 
swer] 


(a) He 3 3 
conclusion a5 


k one: 
f respo! 


should pool the results and publish his 
fact. (0) 
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(b) He should report the results as a tentative 
finding. (26) 


(c) He should run another group of [median = 
20] animals. (21) 


(d) He should try to find an explanation for the 
difference between the two groups. (30) 
Note that regardless of one's confidence in 
the original finding, its credibility is surely 
enhanced by the replication. Not only is the 
experimental effect in the same dir 
the two samples but the magnitud 
effect in the replication is fully two- 
that in the original stud 


ection in 
e of the 
thirds of 
y. In view of the sam- 
ple size (20), which our respondents recom- 
mended, the replication was about as success- 
ful as one is entitled to expect, The distribu- 
tion of Tesponses, however, reflects continued 
skepticism Concerning the student's finding 
following the recommended replication, This 
unhappy state of affairs is a typical conse- 
quence of insufficient statistical power. 

In contrast to Responses b and c, which 
Can be justified on some grounds, the most 
bopular response, Response d, is indefensible. 
We doubt that the Same answer would have 
been obtained if the respondents had realized 
that the difference between the two studies 
does not even approach significance. (If the 
variances of the two samples are equal, ¢ for 
the difference is 3.) In the absence of a 
Statistical test, our respondents followed the 
Tepresentation hypothesis: as the difference 
between the two samples was larger than they 
expected, they viewed it as worthy of expla- 
nation. However, the attempt to “find an ex- 
planation for the difference between the two 
Broups" is in all probability an exercise in ex- 
plaining noise. 

Altogether our respondents evaluated the 
replication rather harshly. This follows from 
the representation hypothesis: if we expect all 
samples to be very similar to one another, 
then almost all replications of a valid hy- 
pothesis should be statistically significant. The 
harshness oí the criterion for successful repli- 
cation is manifest in the responses to the fol- 
lowing question: 


An investigator has reported a result shar Ry 
consider implausible. He ran 15 subjects, and zepa ea 
a significant value, £ = 2.46. Another as in 
attempted to duplicate his procedure, o Au 
tained a nonsignificant value of £ with the sa 
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E mo €— — ame in 
number of subjects. The direction was the sa 
both sets of data. 


" is the 
You are reviewing the literature. W me E you 
highest value of £ in the second set of data tha 
would describe as a failure to replicate ? 


t 
The majority of our respondents regarded 
= 1.70 as a failure to replicate. If the be 
two such studies (¢ = 2.46 and t = s. 
pooled, the value of 7 for the combined " 
is about 3.00 (assuming equal epee. 
Thus, we are faced with a paradoxical a i 
affairs, in which the same data that wou hen 
crease our confidence in the finding iit 
viewed as part of the original study, $ ent] 
our confidence when viewed as an pem y | 
study. This double standard is pur oe 
disturbing since, for many reasons, Ye Cd l 
tions are usually considered as indepen | 
studies, and hypotheses are often eva M. 
by listing confirming and disconfirming 
Orts, n 
á Contrary to a widespread belief, a case i 
be made that a replication sample we: 
often be larger than the original. The ften | 
sion to replicate a once obtained finding ^ 
expresses a great fondness for that finding 
a desire to see it accepted by a skeptical hy 
munity. Since that community pog peli 
demands that the replication be es | 
dently significant, or at least that it su 0 
significance, one must run a large samp del 
illustrate, if the unfortunate doctoral a. 
whose thesis was discussed earlier ee 
the validity of his initial result (t= 2.7 k 0 
= 40), and if he is willing to accept a A 
only .10 of obtaining a ¢ lower than 1.7 * 
should run approximately 50 animals in v" 
replication study. With à somewhat WES {| 
initial result Ge 2.20, N = 40), the size M 
the replication sample required for the 54 
power rises to about 75. jo! 
That the effects discussed thus far are ee 
limited to hypotheses about means and vine 
ances is demonstrated by the re 
following question: 


pis | 


sponses to 


0 
You have run a correlational study, scoring 2d 
variables on 100 subjects, Twenty-seven of the 
correlation coefficients are Significant at the .05 le 
and 9 of these are signi 
The mean absolute ] 
tions is .31, and the 
sonable on theoretica] 


grounds. How many of the 


niia sat correlations would you expect to beise: 
with N EUN in an exact replication of the study, 
Er = 40, a correlation of about 31 is 
is the of or significance at the .05 level. This 
the esie of the significant correlations 1n 
H A s study. Thus, only about half of 
or 14) 4 nally significant correlations (ie. 13 
ü Sap remain significant with N = 40. 
Teplication | of course, the correlations in the 
e E are bound to differ from those in 
Eos = study. Hence, by regression ef 
Most like) initially significant coefficients are 
Peated E: y to be reduced. Thus, 8 to 10 re- 
nal 27 Significant correlations from the origi- 
One is Is probably a generous estimate of what 
entitled to expect. The median estimate 
"pn respondents is 18. This is more than 
ions eua of repeated significant correla- 
recom hat will be found if the correlations im 
from peta 40 subjects randomly arg 
Dect ioa ai 100! "rs pre oo : ls 
inal statisti an a mere duplica jon e ma 
expect stics in the replication samp e; they 
Sults b duplication of the significance que 
Shine little regard for sample size. is 
e re anion requires a ludicrous extension S: 
Smal] Presentation hypothesis; even the law gi 
a reg numbers is incapable of generating Suc 
ult, 
Ts Som epi that patterns of a 
the na almost in their entirety aeg 
Dloreq lonale for a common, though much 
Misa core The investigator who hye 
hxlety correlations between three inden fi 
Y and three indexes of dependency Wi 
Bi. report and interpret with great dr 
taineq Du single significant correlation 2 E 
Stems ; His confidence in the shaky finding 
S from his belief that the obtained corre- 
E matrix is highly representative and 
ily replicable. 
i th review, we have seen that t i 
€ law of small numbers practices science 


he believer 


S follows: 
* 
M gambles his research hypotheses On 
Samples without realizing that the odds 


apa; 
Sainst him are unreasonably high. He over- 
‘mates power. 


8 " 
Examples can be supplied on demand. 
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e He has undue confidence in early trends 
(e.g., the data of the first few subjects) and 
in the stability of observed patterns (e.g., the 
number and identity of significant results). 
He overestimates significance. 

e In evaluating replications, his or others’, 
he has unreasonably high expectations about 
the replicability of significant results. He un- 
derestimates the breadth of confidence inter- 
vals. 
o He rarely attributes a deviation of results 
from expectations to sampling variability, be- 
cause he finds a causal “explanation” for any 
discrepancy. Thus, he has little opportunity 
to recognize sampling variation in action. His 
belief in the law of small numbers, therefore, 
will forever remain intact. 


Our questionnaire elicited considerable evi- 
dence for the prevalence of the belief in the 
law of small numbers.* Our typical respondent 
is a believer, regardless of the group to which 
he belongs. There were practically no differ- 
ences between the median responses of audi- 
ences at a mathematical psychology meeting 
and at a general session of the American Psy- 
chological Association convention, although we 
make no claims for the representativeness of 
either sample. Apparently, acquaintance with 
formal logic and with probability theory does 
intuitions. What, 


not extinguish erroneous 
then, can be done? Can the belief in the law of 


small numbers be abolished or at least con- 


trolled? 

Research experience 
because sampling variation 
ned.” Corrective experiences are 


those that provide neither motive nor oppor- 
tunity for spurious explanation. Thus, a stu- 
dent in a statistics course may draw repeated 
samples of given size from a population, and 


learn the effect of sample size on sampling 


968) has argued that people fail to 

i M tion or certainty from prob- 
«tra ufficient jnforma $ : p 
RS data; he called. this failure conservatism. 
Our respondents can hardly be described as conserva- 
is Rather, in accord with the representation hy- 
Ces they tend to extract more certainty from the 
po! ^a the data, in fact, contain. The conditions 
whic people may appear conservative are 
pee jn Kahnemann, D., and Tversky, A. Sub- 
d ic» probability : A judgment of representativeness. 
tied Rep.) Oregon Research Institute, 1971, 2 (2). 


is unlikely to help 
much, n is all too 
easily ^explair 


5 Edwards (1 
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variability from personal observation. We are 
far from certain, however, that expectations 
can be corrected in this manner, since related 
biases, such as the gambler's fallacy, survive 
considerable contradictory evidence. 

Even if the bias cannot be unlearned, stu- 
dents can learn to recognize its existence and 
take the necessary precautions. Since the 
teaching of statistics is not short on admoni- 
tions, a warning about biased statistical intui- 
tions may not be out of place. The obvious 
precaution is computation. The believer in 
the law of small numbers has incorrect intui- 
tions about significance level, power, and confi- 
dence intervals. Significance levels are usually 
computed and reported, but power and con- 
fidence limits are not. Perhaps they should be. 

Explicit computation of power, relative to 
some reasonable hypothesis, for instance, Co- 
hen's (1962, 1969) small, large, and medium 
effects, should surely be carried out before 
any study is done. Such computations will 
often lead to the realization that there is 
simply no point in running the study unless, 
for example, sample size is multiplied by four. 
We refuse to believe that a serious investi- 
gator will knowingly accept a .50 risk of fail- 
ing to confirm a valid research hypothesis. In 
addition, computations of power are essential 
to the interpretation of negative results, that 
is, failures to reject the null hypothesis. Be- 
cause readers’ intuitive estimates of power are 
likely to be wrong, the publication of com- 
puted values does not appear to be a waste of 
either readers! time or journal space. 

In the early psychological literature, the 
convention prevailed of reporting, for example, 
a sample mean as X + PE, where PE is the 
probable error (i.e., the 50% confidence in- 
terval around the mean). This convention was 
later abandoned in favor of the hypothesis- 
testing formulation. A confidence interval, 
however, provides a useful index of sampling 
variability, and it is precisely this variability 
that we tend to underestimate. The emphasis 
on significance levels tends to obscure a funda- 
mental distinction between the size of an effect 
and its statistical significance. Regardless of 
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sample size, the size of an effect in one study 
is a reasonable estimate of the size of the effect 
in replication. In contrast, the estimated SIs” 
nificance level in a replication depends criti- 
cally on sample size. Unrealistic expectations 
concerning the replicability of significance lev- 
els may be corrected if the distinction between 
size and significance is clarified, and if the 
computed size of observed effects is routinely 
reported. From this point of view, at be, 
the acceptance of the hypothesis-testing model 
has not been an unmixed blessing for psychol- 
ogy. 

The true believer in the law of small nun 
bers commits his multitude of sins against the 
logic of statistical inference in good faith. The 
representation hypothesis describes a cog) 
tive or perceptual bias, which operates regat? 
less of motivational factors. Thus, while a 
hasty rejection of the null hypothesis is gani 
fying, the rejection of a cherished hypothes! 
is aggravating, yet the true believer is subjec’ 
to both. His intuitive expectations are are 
erned by a consistent misperception of E. 
world rather than by opportunistic wishit 
thinking. Given some editorial prodding; ~, 
may be willing to regard his statistical int 
tions with proper suspicion and replace " 
pression formation by computation whenev 
possible. 
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The mutually beneficial effects of college students working as therapeutic agents 


with individuals in a variety 
based primarily on observ: 


students may be useful as therapeutic age 
involved in a helping relationship exhibit 


effected by more traditional psychotherapies. 
vill require further empirical investigation before the wide- 


als in psychotherapeutic settings can be sanctioned. 


unresolved issues W 
spread use of nonprofession 


aah continuing investigation (1959, 
ae E 1968) of professional manpower re- 
E on in the mental health fields has made 
fon EGRE the severe current shortages and, 
PE has suggested the probability of even 
lish i future shortages. It has been estab- 
is is that with present methods and models 
he mental health field it is simply not 
Suse to satisfy the ever increasing social 
mands for professional personnel. 
as Second aspect of the mental health man- 
ent dilemma is that even if there were suf- 
et t numbers of professional personnel, pres- 
c iere health ideology would prevent 
am y of those needing help from receiving it 
pon, Rubinstein, & Speisman, 1969). 
e S fact is most evident in the notorious lack 
P inis mental health professionals have 
“c Working with the poor (Grosser, Henry, 
Kelly, 1969; Pearl & Riessman, 65). 
e groups such as drug abusers, alcoholics, 
prs delinquents have also been x 
Dri ed by the mental health professionals 
marily because professional contact with 
m has been, for the most part, fruitless. 
Es other hand, Blau (1969b) me 
Bte: nonprofessionals appear to have had a 
At deal of success in working with these 
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of mental health 
ation and clinical impressionism su 


settings are reviewed. Studies 
ggest that college 
nts, and, at the same time, students 
personality changes not unlike those 
There is evidence that many 


grams such as Alcoholics Anonymous and 
Synanon (Volkman & Cressey, 1964; Ya- 
blonsky & Dederich, 1963). 

In addition to the success of nonprofes- 
sionals working directly with these individu- 
als, another kind of role for the nonprofes- 
sional may be useful to mental health concep- 
tualization. Reiff (1967) and Pearl and Riess- 
man (1965) have suggested that one of the 
most important uses of nonprofessional per- 
sonnel is a “bridging” function, that is, a link 
between professionals and their target popu- 
lations. In order to be an effective “bridge,” 
the nonprofessional must be indigenous of the 
target population and speak the vernacular of 
both the target population and the profes- 
sional. His interclass communication and 
mediation skills can facilitate communication 
between the two groups, and, thus, he will be 
able to discuss the needs and goals of each 


group with the other. 

Other groups go unt 
tal health professiona 
treating them, but rathe 


choose situations that jore i 
and may prefer to utilize their time working 


with individuals who manifest dramatic 
change more rapidly. Nonprofessionals may 
be able to fill this void by working effectively 
with some of these neglected groups; such as 


retarded children and geriatrics. In this way, 
nonprofessional mental health personnel may 
be the key to providing more complete cov- 
erage in terms of mental health services to 
the entire population. 
More provocative is 
nonprofessionals are m 
professional counterparts 


reated, not because men- 
ls are less successful in 
r because professionals 
are more challenging 


the evidence that some 
ore effective than their 
in working with 
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some populations which are presently receiv- 
ing professional focus. Zunker and Brown 
(1966), for example, found that student 
counselors were more effective in counseling 
college students than were the professional 
counselors. Poser’s (1966) college girls pro- 
duced more positive changes in hospitalized 
chronic schizophrenic patients than did the 
professional staff. If indeed nonprofessionals 
are more effective than professionals in deal- 
ing with certain populations, then part of our 
professional mental health manpower resources 
is being wasted! 

It is necessary to take a new look at tradi- 
tional conceptualizations in mental health, to 
examine the parameters of professional in- 
volvement in social problems, and to set goals 
for the future, All aspects of the manpower 
dilemma must be investigated to insure maxi- 
mum utilization of present resources as well as 
seeking out new Sources of mental workers. 

Attempts at amelioration of the manpower 
dilemma in psychology have included subdoc- 
toral education (Albee, 1968; Arnhoff & Jen- 
kins, 1969; Hoch, Ross, & Winder, 1966; 
Knott, 1969), the examining of new models 
for psychology (Albee, 1968; Hoch et al., 
1966), and the utilization of volunteers or 
nonprofessionals (Christmas, 1966; Holand & 
Voss, 1968; Reding & Goldsmith, 1967; Sa- 
vino & Schlamp, 1967; Wahler, Winkel, Peter- 
son, & Morrison, 1965). 

Volunteers or nonprofessionals with varying 
levels of training, motivation, and socioeco- 
nomic status have been used in a variety of 
situations. Two investigations by Stover and 
Guerney (1967) and Patterson and Brodsky 
(1966) have utilized mothers as therapeutic 
agents for their own children while Rioch, 
Elkes, Flint, Usdansky, Newman, and Silber 
(1963), Nichtern, Donahue, O'Shea, Marans, 
Curtis, and Brody (1964), and Donahue and 
Nichtern (1965) have trained mature women 
to act as therapists for the children of others. 
Hirsch (1968) and Zimmerman and Zimmer- 
man (1962) have shown that teachers can be 
effective as therapeutic agents. Even grand- 
parents and retired individuals have been put 
to work as psychotherapeutic agents (Cowen, 
Leibowitz, & Leibowitz, 1968; Johnston, 
1967). Hartog (1967) has described the work 
of volunteer enlisted men consultants in a 
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United States Army hospital, while Fellows 
and Wolpin (1969) and Perlmutter and Dur- 
ham (1965) have described the use of high 
school teenagers in mental health programs. 
Grosser et al. (1969) have reported on E 
Conference on Nonprofessionals, which dea 
primarily with the use of indigenous person- 
nel in health and welfare fields. 

The present review centers around college 
students who act as volunteers or nonprofes- 
sionals in various mental health facilities. TH 
focus upon college students is relevant no 


ities” 
only because they appear to have qualit 


and characteristics which may suit them " 
work with troubled individuals but also pa 
cause such work appears to have a pase 
developmental influence upon their own p 
sonalities. Because of gross methodologlc" 
deficiencies in most of the studies inven 
college students as therapeutic agents, it 
impressions and this review are based pe 
marily upon conceptual analysis of present ¢ b- 
mental health issues, its logic, and clinical 0 


i er E : ion- 
servation. The justification of nonprofess! | 


als and the issue of college student devel 
ment need more study before widespread U^. 
of college students in mental health setting 
can be sanctioned. The function of this ie 
then, is to identify and structure imports, 
areas which require further empirical inve 
tigation. 


: e 
patients, the colle? 


F a | 
by selecting duties that keep the? 
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Tis 
o m Breage (1962) suggested that the 
Primarily fr udent success with patients results 
ae crusad we three factors. First of all, they 
nding a eni. feel the exhilaration of 
revolution; tei hwhile cause, Further, they are 
Mental illness engaged in a struggle against 
in mental pe tise toll of which can be seen 
altruistic Moe oe Finally, an element of 
observed by "s ty characterized the students 
Mitchell been ge et al. (1962). i 
with dilde 966) posited that in working 
3 particular t; Ree students seem to have 
On world a ent for finding the child in his 
that the c Ip. 311].” He further suggested 
eir | Sewing dress of the college students, 
aneity anil m, and, in particular, their spon- 
Ures Which enthusiast are all disarming fea- 
tole, Th his enhance the effectiveness of their 
lor of colli argument against equating behav- 
Psychother girls in mental hospitals with 
at tulle Rosenbaum (1966) reasoned 
te near DU, particularly young girls, 
ations who 3 because mental hospital popu- 
Will respo “on been rejected by our culture 
E timane to young, vibrant people who 
imilariti and extend warmth. [p. 294]. 
Mental han such as residing in a develop- 
Ay creat, itution and seeking life changes 
and hindi. bond between college students 
Ute à iu es of mental hospitals and consti- 
0 urther basis for the reported success 


diy ege students working with troubled in- 
o uals, Keniston (1967) pointed out that 
patients 


th : 
^ college students and psychotic s 
he 
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ul mon characteristics of which include stim- 
the devel- 
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also sees the locus of control outside himself. 
The professional therapist, on the other hand, 
may appear as though he long since attained 
identity, security, and control of self and thus 
cannot recall the intensity of his own strug- 
gle. If then, as some authors suggest (Rogers, 
1957; Truax & Wargo, 1966), accurate em- 
pathy is the single most important therapist 
characteristic in determining outcome of ther- 
apy, then the college student instantly and 
automatically has an advantage over his pro- 
fessional counterparts via his social status. 
Furthermore, if feeling closer to the therapist 
in terms of social status does facilitate “in- 
stant felt empathy,” then college students 
should be even more effective working with 
other college students, adolescents, and chil- 
dren. 

Because they cannot rely on professional 
training or the professional facade, college 
students are forced to use à naive, common- 
sense approach to their encounter. Perhaps the 
fresh approach of these young people has an 
effect on depressed or disturbed individuals 
that the trained professionals have difficulty 
duplicating. Rogers (1957) has postulated 
that theoretical wisdom is not a necessary in- 
gredient for a helping relationship, and, to 
expand Rogers’ contention, it may be that 
theoretical wisdom is actually detrimental to 
some therapeutic encounters because it re- 
strains and constricts the variety of ap- 
proaches available to the therapist. College 
students, on the other hand, are free of this 
theoretical constraint and are less inhibited in 
trying new approaches. As a result of their 
ignorance they may uncover effective new 
approaches considered inappropriate or too 
illogical by professionals. The present author 
is not advocating disregarding all theory but 
simply suggesting that we may be able to 
learn from the fresh approach of untrained 
college students. 

Reiff and Riessman ( 1965) have postulated 
that greater flexibility in terms of appropriate 
and accepted behaviors on the part of the non- 
professional may be a special asset. Whereas 
a college student may take his charge to the 
x UE. be LR e. 

? : is role prescription, 
would not engage in these activities. Poser's 
(1966) college girls could play “Drop-the- 
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Hanky” with their patients, but the profes- 
sionals in the study would not have dared such 
activities. 

A final advantage of using college students 
as therapeutic agents and a further reason for 
their apparent success particularly in settings 
outside the hospital is that there may be less 
stigma involved for the client, Whereas a 
parent may be concerned about the stigma 
attached to sending his child to a psycholo- 
gist, he may be less inhibited about his child 
seeing a college student given, of course, that 
therapeutic effect is roughly similar. To sum, 
it appears that college students may have an 
advantage in working with some populations 
by virtue of their not having prior training or 
professional status. 

Personality theorists who are particularly 

interested in college student development 
(Madison, 1969; Sanford, 1962) suggest that 
college students have a significant potential 
for change, and there is a continuing search 
for ways in which the personality develop- 
ment can be facilitated. Evidence suggests 
that working part time in a community mental 
health facility may serve as an instrument of 
Personality change. College students who work 
in mental hospitals, psychological clinics, or 
other mental health settings manifest signifi- 
cantly more positive changes in self-acceptance 
and moral judgments in sexual and aggressive 
acts than do control groups (Holzberg, 
Gewirtz, & Ebner, 1964) and also greater 
self-understanding (Reinherz, 1962; Stollak, 
1969; Umbarger et al., 1962). Increased self- 
confidence and enhanced identity formation 
are further personality changes effected by 
Working in a mental health installation 
(Scheibe, 1965; Umbarger et al., 1962). 

Even if college students possessed none of 
the above characteristics which make them 
particularly desirable as therapeutic agents, 
the discovery of a large untapped pool of 
volunteer workers in mental health would be 
welcome news. The suggestion that they do 
have characteristics and qualities which make 
them particularly suited to working in a men- 
tal health setting, as well as evidence that 
working in mental health may foster desirable 
personality changes, makes the topic not only 
more interesting but also worthy of theoreti- 
cal attention and of a review. 


GENE G. GRUVER 


COMPANION PROGRAMS 


Companion Program is a term used to de 
scribe situations in which college students 
spend a certain amount of time each ve 
"companions" to patients in mental hospita i 
Companion Programs may be structured 9 
unstructured, provide training or no a 
provide monetary remuneration or no e 
tary remuneration, but they share the 2" 
mon feature that individuals from the p 
munity are brought into regular face-to- M 
contact with individuals evidencing behav! 

roblems. A | 
; The first Companion Program originated 
in 1954 at Harvard University and provee 
Service to the Metropolitan State Hosp. 
(Umbarger et al., 1962). By 1962, eh E 
had spread to such an extent that a boo Um- 
scribing one program was published 3 
barger et al, 1962), and a conference a 
held to discuss the impact of the progr’ 
(Kantor, 1962). The Companion Pros! 
idea is continuing to expand and is ust 
met with enthusiastic response from stude - 
staff, and patients alike (Spoerl, 1968). Hos 

The program at Metropolitan State m 
pital has been the model for subsequent ue j 
panion Programs, although not all new E 
grams have been so ambitious as the baci 
in which 2,000 students had been in con ais 
with mental patients during the first 7 er 
of operation. At Metropolitan State HOSP pa 
there are four separate areas in which x 
students might choose to work. Each situat! 


going for walks a 
rmed by volunte? 


be 

1 d fourth work area was initiated. 
in E ut a halfway house for patients 
outside com etween the hospital and the 
grams at oh of these four pro- 
loosely NS letropolitan State Hospital are 
orm Spleen provide training in the 
uting the uh meetings held once a week 
and (d m students are entering services; 

Usine we monetary remuneration. 
Program at a er measure one chooses, the 
een an ba Mettopaütan State Hospital has 
Patients Ang helming success. The effects on 
been, for re student volunteers alike have 
| Success on e most part, positive. Measures of 
Were sub: the ward improvement projects 
jective and consisted of the students' 
Di Adult 
of rea = also included the recording 
. Portant E T hi were dee. e a 
ne of (ins, dur in patient group ehavior. 
se landmarks, for example, was the 


acco: 
un m = ? 
nit ts, which were written in the 


oar ol doors between the women's “vio- 
ie s b and its male counterpart. , 
easure of success in the case-aide pro- 
not only more objective but more 
MS l4 ps By the end of the first year, 11 of 
^ aide Faroe who had been visited by case- 
Ospita] un had been released from the 
| g vith a readmission rate of less than 


Stam js 
Startling, 


Eo Improvements in the other three pa- 
$ could be measured bY their transfers 
Boa Secure wards and by 9 
€ achievements. i 
E a follow-up study of the chronic psy- 
c patients seen by college case-aide ue - 
Ou ers, Beck, Kantor, and Gelineau (1005) 
h nd that of the 120 case-aide patients who 
31% been seen from 1954 to 1961, 3r 
wit) had left the hospital while wo 
of rami o Of these 37; 
n Jo hospital at the tim 
Mies eee of 3.4 year 
ee Seven more P 
al a few months after 
were all out at follo 
E 12 years each. The fn ing ghat S leave a 
Oup of psychotics dn NUR ere 
Chronic service atter «treatment s ie 
tude grvice t the assertion at 
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As has been suggested, the measures of 
success of the program at Metropolitan State 
Hospital include not only the effects upon the 
patients but also the effects upon the stu- 
dent volunteers. All students in the program 
‘that they learned a great deal from 
Many felt that 
tient and the in- 


claimed ‘ 
the case-aide experience.” 
their relationship with a pa 
struction of the group leader had taught them 
more about psychological theory and mental 
illness than had their courses at college. Some 
became interested in careers in mental health 
work. Moreover, “all claimed that they had 
gained insight into their own personalities 
and problems through their relationships with 
the patients and their own group [Umbarger 


et al., 1962, p- 54]." 

Erikson (1959) suggested that the crystal- 
lization of professional goals is a major phase 
of the process identity formation. If this is 
true, then. work in the case-aide section of 
the program can be said to have facilitated 
identity formation since Kantor (1959) and 
Greenblatt and Kantor (1962b) have shown 
that more than 7076 of the students who were 
indefinite or undecided about career choices 
before participating in the case-aide program 
made concrete choices in the direction of 
mental health. In evaluating Kantor's find- 
ings, it should be noted that no control groups 
were used, and, moreover, students! career 
decisions during college years tend to be un- 
stable. However, Kantor's essential conclu- 
sion that the project influenced the career 
choice of participants in the direction of men- 
tal health is probably valid. 

As in most new areas of research, objec- 
tive measures of college student success with 
patients are scarce; however, a large body of 
relatively objective research, particularly con- 
cerning the student development aspect, has 
come out of the Connecticut Valley Compan- 
ion Program, which was modeled after the one 
at Metropolitan State. In one of the studies 
(Holzberg & Gewirtz, 1963), a group of stu- 
EET e i the companionship 
of students who viliisered f contol group 
service activities such as YM an orie a 
Brothers. On a question: a bean 

zs : naire that was ad- 
ministered to both groups at th ee 
of the academic yea the beginning 

r and again at the con- 
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clusion of that year, the subjects in the Com- 
panion Program shifted significantly in a 
positive direction in terms of their attitudes 
toward and knowledge of mental illness. 

In another study at Connecticut Valley 
Hospital, a questionnaire measure yielded 
data suggesting positive effects on both pa- 
tients and students (Holzberg, Whiting, & 
Lowy, 1964). Eighty-four percent of the pa- 
tients said they enjoyed the relationship with 
the students, while the students reported that 
71% of the patients showed improvement 
over the year. Ninety-one percent of the stu- 
dents themselves reported they became less 
anxious about working in a mental hospital, 
9055 reported a greater understanding of 
mental illness, 84% suggested feelings about 
mental hospital personnel had changed, and 
97% of the students considered that their 
experiences had contributed to their personal 
growth. In another study (Holzberg et al., 
1964), the effects of association with hos- 
pitalized mental patients on the personalities 
of 32 male college students were compared 
to a control group of 24 students who had 
not been involved with mental patients. Stu- 
dents in the Companion Program demon- 
strated significantly positive change in self-ac- 
ceptance and in moral judgments concerning 
sexual and aggressive behaviors. Holzberg and 
Knapp (1965) have presented further evi- 
dence of positive effects upon Companions in 
their findings that after serving as Compan- 
ions they are less frequently on academic 
probation and that they increase their intro- 
spective behavior. 

More recently Holzberg, Knapp, and Tur- 
ner (1967) have collected psychological test 
data so as to compare patients in the Com- 
panion Program with a group of control 
patients who were not. A relatively small 
sample of 13 Companion patients and 30 con- 
trols completed the Minnesota Multiphasic 
Personality Inventory (MMPI), Bender Ge- 
staldt Designs, and Draw-a-Person before and 
after the program, Companion patients 
changed significantly on the Depression scale 
of the MMPI from pretest to posttest, and 
a similar difference on the Paranoid scale 
barely missed significance, The investigators 
are still studying data on the Bender Ge- 
staldt Designs and Draw-a-Person to see if 
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significant changes occurred in Companion 
patients on those instruments. L 
One may argue that college students VO 

unteering for service as a Companion appear 
to change to a greater degree because they 
are more pathological at the time they 
volunteer than the average college student. 
Knapp and Holzberg (1964) have shown this 
not to be the case. A group of 85 male col- 
lege students volunteering for service as Cora 
panions to chronically ill mental patients Me 
compared to a group of 85 control studen 

on a number of psychological tests adminis- 


i . sear. 
tered during the students’ freshman Ye | 


Tests included MMPI, Edwards Persona 
Preference Schedule, Allport-Vernon-Lind2ey 
Scale of Values, the Scholastic Aptitude Tesi 
and the Terman Concept Mastery Test. S 
dents taking part in the Companion Progr 
were shown not to be different from b». 
control counterparts in any significant v 
cal respect, but there is evidence that L. 
Companions are (a) slightly more religiously 
oriented, (5) more morally concerned, 1 
more compassionate, and (d) more de 
verted than the control students. In a a, 
study, volunteers were also differentiated ei 
controls on academic variables such as en 
major area of study, frequency of disciplina! 
action, and fraternity affiliations (Holzber® 
et al., 1967). ich 
Scheibe (1965) described a program wr 
is similar to the Companion Program mo rk 
except that students were assigned to PU. 
for a continuous 8-week period rather m 
once-a-week appearances over a year's a 
as required in the Companion Program. E 
dents in the Service Corps Program of. 
State of Connecticut lived at the hospital n 
spent a normal working week with chron 
patients for which they received $200 ug 
for the 2-month period. Students were not p 
signed to a specific patient but worked w 
all the patients on the ward in unstructu" 
activities, Positive changes in the stude?" 
description of the typical mental patient p* 
noted on Gough's Adjective Check List gv e 
at the beginning and at the end of the W^ 
period. In describing themselves on the kw 
Adjective Check List, students exhibited we 
nificant gains in Achievement, Domina se 
Self-Confidence, and Nurturance. There © 


Epod adverse effects on the college stu- 
(s as a result of working with the mental 
E cist rede p were substantiated in that 
3 ysta ization of vocational goals appeared 
a direction favorable to mental health. 
Seer si Kulik, and Scheibe (1969) subse- 
ently published a more detailed study oí 
yw characteristics of college volunteers 
| fifty. Service Corps Program. One hundred 
Em | students serving in the Connecticut 
imme Corps and 142 controls enrolled in 
bus si school at four Connecticut colleges 
E a battery of tests and ques- 
E including California Psychological 
Ie eed Gough Adjective Check List, Rot- 
Cres er nal-External Small Scale, Marlowe- 
oed Social Desirability Scale, the Strong 
re e oig e Blank, and a biographical 
ko mig The striking personal charac- 
Slants of the college student volunteers were 
Stree and control, drive for independent 
doe cy and sensitivity to distressed in- 
Bek : On the Strong Vocational Interest 
porte uod interests were similar to those in 
biogra E emphasizing social service. Auto- 
college hical data further indicated that the 
Orient baa a volunteers were more service 
ACA and more dedicated to mental health 
none The authors concluded that. data 
volun ed here suggest that participation in 
cern teer work is not motivated by overcon- 
Dart] with personal problems but rather is 
indep, attributable to a controlled drive for 
quie achievement and sensitivity to 
tan problems [p. 34]." 
Puer and Lipton (1963) reported on a 
ix "e at Morristown State Hospital wherein 
No » lege students were employed full time: 
Instructed © was given, but the eee = 
ing hi ed to devote all their efforts to ct E: 
vid, 1 Enly personal relationships with indi- 
Orts patients. Based upon subjective re- 
Oteq of the students, positive changes were 
in individual patients. 
th vine (1966) reported an ir 
weed anges in attitude and behavior pro- 
5 in students by a nonacademic, off-cam- 
Program which he suggested appeals to 


nq p 
ig, Diis to work the unenacted idealism of 
acr S college youth. Recreational and social 

ents fostered 


Civit: : 
ities with the mental pat! 


h 
„vestigation of 
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more positive attitudes toward and increased 
interest in social action. 

] Poser’s (1966) now classic study was de- 
signed to compare the effect of professional 
and untrained therapists. The untrained ther- 
apists were 11 college girls as well as two 
inpatients—none of whom had training in 
psychology. The professional therapists. in- 
cluded psychiatrists, psychiatric social work- 
ers, and occupational therapists. Poser di- 
vided 343 male chronic schizophrenic patients 
into groups of 10 with each group matched 
as closely as possible with every other unit 
in terms of patient age, severity of illness. 
and length of hospitalization, Each of the 
groups were then assigned to a therapist 
picked at random. Each therapist met with 
his or her group for 1 hour each day, 5 days 
a week, over a period of 5 months. Both the 
trained and untrained therapists were free to 
conduct their therapy sessions as they wished. 
Results were interpreted using the difference 
in pretherapy and posttherapy Scores on six 
different psychological tests. It was found 
that the college students achieved slightly 
better results than the professional mental 
health workers doing group therapy with 


similar patients. 

Walker, Wolpin, and Fellows (1967) de- 
scribed a program which was a joint venture 
between Westmont College, Santa Barbara, 
and Camarillo State Hospital, Camarillo, 
California. Students received college credit for 
research and service activities involving di- 
rect contact with patients. Using a modified 
sentence-completion test for the students and 
subjective reports 0 


f the patients, they con- 
cluded that “we may 


be able to foster better 
personal developments as we 


]l as enrich school 
and college curricula while developing poten- 
tial interest and entrance 


into the mental 
health field [p- 188].” 
ted on students from 


Spoerl (1968) repor 
the University of Washington who volun- 


teered to work on the psychiatric service of the 
University Hospital. In order to capitalize on 
the student spontaneity and imagination, no 
instructions were given except that the volun- 
teers were to begin their relationship with 
patients On a peer basis. Volunteers could 
work with either individuals or groups. No 
objective evaluation of the student volunteer 
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project on the patients is available, but ques- 
tionnaires were given the ward staff and pa- 
tients, and written reports were solicited from 
the volunteers. Most of the comments on the 
questionnaires and written reports were favor- 
able to such an extent that the program has 
been enlarged to involve more volunteers and 
more patients. 

Cowen (19692) discussed an innovation in 
which college students are enrolled in a year- 
long seminar practicum in mental health: The 
first 2 months are used in examining current 
and past community mental health programs, 
and then the students, working in groups of six 
to eight each with a graduate student leader, 
begin service in one of three community set- 
tings: (a) a state hospital in which they en- 
courage verbal and nonverbal interaction with 
chronic schizophrenic patients, (5) a school 
setting working with young students experi- 
encing school adjustment problems, and (c) a 
settlement house with very young children 


who are exhibiting very slow social verbal 
development. 


COLLEGE STUDENTS WITH CHILDREN 


Many of the Companion Programs have 
special Children’s Units, but, since children 
appear to be more malleable and become much 
more intensely involved with the student 
companion, volunteers for the Children’s 
Units are usually subjected to a far greater 
amount of control than those working with 
adults. College students often prefer to work 
with children for a number of reasons. First, 
improvement in the younger patients is more 
easily observed even by naive volunteers. Also, 
students discover that in just a short time the 
children begin to respond positively to college 
students. Umbarger et al. (1962) reported 
that students working with the children felt 
less anxious about their own identity and more 
successful in their work than they did with 
the older patients. Students were apparently 
much more effective because they could act 
in a more relaxed and normal manner. Fur- 
ther, socially validated roles of big brother 
and big sister worked extremely well with die 
children while no such role was readily avail- 
able with the adult patients. oer 

Kreitzer (1969) used psychology majors 
who received course credit for their work with 
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hospitalized emotionally disturbed children. 
Training consisted of completion of course 
work in psychopathology as well as a weekly 
2-hour group supervision session. Measures 
included staff member ratings and teachet- 
rated changes in, or elimination of, inappt™ 
priate behavior. Many of the target behavicM 
were reduced or eliminated, and some of E 
student therapists involved called this tP 
"highlight of their college experience." i 

Reinherz (1964) reported a project ! 
which students from Radcliffe and Harel | 
volunteered for work at Massachusetts M. 
Hospital working with children who were 51 1 
patients and who did not have severe betia 
ioral problems. Volunteer college ace 
spent one afternoon a week with the chil 
after having met with a social work supervis 
for 15 minutes ahead of time to pu ) 
progress report of the child. During the E 
year of the program, ward psychiatrist, é 
ported improved functioning in three OU ed 
the four children in the program. In the E. 
ond year, physicians reported change oa 
progress in all seven patients, and, in se | 
cases, psychological tests confirmed pos! a 
growth. At the end of the second year, bite. 
the seven patients were ready for dischar? 
and a third had gone home on extended yo 
Earlier, Reinherz (1962) had observed | 3 
some of the successes that college nd 
have in working with emotionally distor 
children may be due to their having rece r8] 
solved or left unsolved basic issues of qo 
tion in their own lives. She noted that in olm 
adolescence identity problems such as seX E j 
and career choice are important developmen s 
issues, and their successful resolution d 
the difference between a productive and m 
productive adult role. Often it was obser? 
that as the student aided the child in work 
out the problems of self-maturity, the stude" | 
too, appeared to be gaining a definitive 59 
tion for himself. 

Fellows and Wolpin (1969) reported 
project using teenage Psychology trainees, 
a mental hospital. Four academically super! 
boys spent 4 days per week working with ad? 
lescent and preadolescent boys in a ment? 
hospital. They were paid s 
2 months. Following a 
the hospital, the trai 


a 
jn 


Cou s. 
B uum the adolescent male unit engag- 
iline ero conditioning procedures and con- 
own EA therapy. Students were on their 
iscussion sm 1 hour per day of permissive 
Ment in the vs n: supervisors. Measure- 
With the dre was primarily subjective, 
Notable effect hors reporting a particularly 
| boys idealized bom the patients in that the 

Whatever s E the trainees and sought to copy 

Dorteq ne e did. More objectively, it is re- 

i at a number of boys from this par- 


tic 

ilar ¢ F 

or igang “have been placed in foster 
j P homes where they seem to be ad- 


. Justing w 
ke having learned some aspects of 
Alot avior from the trainees [p. 276]-" 
Ubon Studer e the emphasis has been placed 
ulation in hts working with an inpatient pop- 
Programs mental hospitals, but many day 
Ihitiateq. or outpatient programs have been 
elz, d mien] in caring for children. 
Sested th; mel, and Silvertsen (1967) sug- 
. Rredient at volunteers are an essential in- 
. Cility DEAS ongoing functioning of a fa- 
| of children ‘ated to the psychiatric treatment 

| r 

dents caa, (1967) has shown that college stu- 
Pandeq m e used as helpers in providing ex- 
polit rud health services for children. 
children ted out that the attrition rate of 
in guidance clinics has been found to 


e an d 
Mi, d 60% (Tuckman & Lavell, 1959). 
, Qugh companionship may not replace cer- 

vices, it does 


tain g; 
rec gnostic and treatment ser t 

Vill use a service that parents and children 
attrition He based his suggestion on the 1 
di Nin rate of 12% for this particular stu ly 
mast he used college students as ig 
a: prs He concluded that college students 
Y facilitate and enhance treatment of chil- 


r s 
en with psychiatric problems. E 
ò 9odman (1967) has experimented with 
ep Panionship therapy between collegs ste 
We. ts and troubled boys- Male college P n 
Sh re trained in a 24-day experimental po > 
ED and yere paid $1.40 per hour. Mter 
FOunselors were selected, they were divided 
to a “quiet” group an “outgoing” group, 


hh "et counselors being paired 
With alf the quiet quiet problems, that is, 


boy : ing 

: ys evidencing i 

Social oen rhe other half of the guet 

Ounselors were paire with boys having out- 

oin were | the same procedure was 
g problems, a? 
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followed for the group of outgoing counselors. 
Although only tentative findings are available, 
results suggest that boys with social intro- 
version problems gain most from participat- 
ing in the program. As in the Companion Pro- 
Goodman noted that his students mani- 
ality changes not unlike those 
Holzberg (1963). Goodman’s 
a dramatic increase of in- 
ldren and in work- 


grams, 
fested person: 
reported by 
counselors showed 
terest in the behavior of chi 
ing with troubled people, and they reported 


srovement occurred in the way they 


that imp 
interact with friends. Differences between 


counselors and matched controls who did not 
participate in the Companion Program are 
significant. Nonparticipating controls were 
used so as to determine the effects on the 


counselor in working with children. 
Mitchell (1966) has coined the term “ami- 


catherapy" which he described *as a form of 
therapeutic intervention whereby layman vol- 
unteers relate in sustained friendship roles to 
troubled and disturbed persons under the 
guidance and supervision of professionals [p. 
307].” He suggested that there may be ad- 
vantages to using college students in amica- 
therapy since college students seem to have a 
particular knack for meeting the child in his 
own world. He postulated that this may be 
the case since students are not yet irrevocably 
committed to the adult world. After a student 
is chosen to work with a particular child, he 
sees a supervisor for an orientation interview 
and is instructed simply to make friends with 
the child. Clinical investigations of 74 chil- 
dren who participated in Mitchell's program 
have indicated “all of the children have bene- 
fited by their relationship with the student 


volunteers [P- 314]." 
d Linden and Stollak 


Stollak (1969) an 
(1969) have investigated the possibility of 
training college students as play therapists. In 
the former study, the students’ role is modeled 
as closely as possible to that of a client-cen- 
tered play therapist. The basic task is to be 
empathetic, understanding, nondirective, and 
to convey this understanding and acceptance 
to the child. Students were trained in 10 ses- 
sions during which they observed play therapy 
techniques and played with normal children. 
At the end of the tenth session, each student 
was assigned a child between the ages of 4 
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and 10 who were taken from the waiting list 
of the Lansing Child Guidance Clinic or the 
Psychology Clinic at Michigan State Univer- 
sity. Stollak (1969) noted that undergradu- 
ates do significantly change their behavior 
during the sessions by increasing their reflec- 
tion of content and clarification of feeling 
statements. Linden and Stollak (1969) con- 
cluded that communicated empathy is not an 
innate ability but must be taught. This has 
an important implication for the utilization of 
college students in mental health settings. Tf 
one adheres to the client-centered tenet that 
communication of accurate empathy is a nec- 
essary prerequisite for therapeutic movement, 
the turning loose of naive, untrained college 
students on a mental hospital is not as effec- 
tive in producing therapeutic change as the 
same students might be if they were first 
taught to communicate the empathy they feel, 
that is, to make appropriate verbal statements. 

Davison (1966) trained undergraduates as 
Social reinforcers for autistic children at a 
private day care center; however, the results 
Were less than significant when kind and de- 
gree of behavior change were used as the cri- 
teria for success. Less than 15 hours of the 
child's waking hours were spent in the pro- 
gram, and, since learning theory depends on 
environmental contingencies for factors that 
shape and maintain behavior, these results 
were not unexpected. One important finding 
which did emerge from this study, however, is 
that intelligent, motivated college students can 
be trained in a very short time to carry out a 
behavior control program that applies learn- 
ing principles to the manipulation of psychotic 
behavior in children. 


COLLEGE STUDENTS AS THERAPEUTIC AGENTS 
IN RELATED AREAS 


Heretofore this study has focused upon col- 
lege students as therapeutic agents in settings 
traditionally within the realm of men ke 
health, that is, mental hospitals and psycho- 
logical clinics. In recent years, however, men- 
tal health professionals have come to -— 
other settings as legitimate surrounds for 
mental health workers, Cowen has headed a 
number of research efforts seeking possentinm 
of emotional disorders in the school setting 
(Cowen, 1968; Cowen, 1969b; Cowen, Car- 


GENE G. GRUVER 


lisle, & Kaufman, 1969; Cowen, Izzo, MM ; 
Telschow, Trost, & Zax, 1963; Cowen, 2 
Izzo, & Trost, 1966; Cowen, Zax, & Tai 
1966; Zax & Cowen, 1967). Cowen, Zax, E. 
Laird (1966) selected 17 undergraduate a 
unteers to provide emotionally disturbed C 
dren with a meaningful relationship by pa 
them with active, enthusiastic college ^ 
dents. Student volunteers had no training um] 
were encouraged to foster a spontaneous, T" 
friendship with the child. For a varie w 
assumed reasons, none of the outcome e 
sures discriminated between the expert A j 
group of emotionally disturbed children. s 
tainly the short time the program Mt. 
effect (2 months) limited the number of oa 
tacts. Another reason there may have ws 
significant difference between the two s 
is that the control group was simultane? 
engaged in another program aimed at a 
venting emotional disturbance, There now 
significant changes among the Volunteers a 
ever, in that institutional concepts wer e 
longer rated in a stereotypically positive ^ aa 
Further, volunteers rated youngsters bat 
encing emotional difficulties in a more porie 
and accepting manner on a semantic pm 
tial It will be recalled that similar m 
toward more positive attitudes toward m tici- 
illness have been found to result from d & 
pation in Companion Programs (Holzber* 
Gewirtz, 1963). ared 
In another study, Cowen (1968) compa 
the effectiveness of two interventive prog «d 
—one using housewives, the other using a 
lege students. Both groups of volunt nd 
worked with children experiencing emit” d 
difficulties in a school setting. Two indeP 0 
dent rating-scale evaluations were used " 
measure changes in behavior of the yet 
Sters. Mean improvement scores of the E) 
experimental groups combined were sign 
cantly greater than those of noncounse 
control groups; however, only the group B 
by the housewives was rated as significant 1 
improved. Children seen py the college St" 
dents were directionally better than control 
but the difference was not significant. Cow” 
postulated several reasons for this different” 
First of all, the housewives were carefull* 
screened, with the experimenter selecting 07 
those applicants with attributes facilitati”? 
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inti á 
i ooa iopeactions with young chil- 
only on — student selection was based 
inated Te nS UDE screening which elim- 
" na ro obviously unsuited. , Also, 
Schools Tes mm 2 years experience in the 
Previous im eas the college students had no 
| new BN queni Further, the housewives 
erally better personnel better and were gen- 
| Also, the s Hi i in the school setting. 
exible Pier edules of housewives were more 
umber "y they were available at a greater 
ents, The times than were the college stu- 
| Personnel e subjective impressions of school 
were that these facts stacked the 


Cards ; 

S a 

In favor of the housewives. 
) 


J: McV. Hunt (1969) discussed a model for 
the Hall-Nebraska 
] in a 
h vari- 


quchology he called 
(P where students are involvec 

selor-counselee” relationship witl 
ho exhibit a variety of 
y. Under- 
llege pals 
milies, chil- 


Ous ki 
Kinds 
D ds 


high school 
delinquents. Under- 


ablished an ongoing 
ial in one of the 


and juvenile 
| te Cii college pals est 
un with an individu r = 
Conta, entioned categories and continued is 
Coun, ct throughout the school year. When the 
Eur left school, he introduced to his 
Poste a new counselor and observed and 
a Uraged the new relationship. The college 
i qussociation has been particularly effective 
e family project. This program deals with 
ching ldren—7 in each of three families, each 
tro having a “college pal.” There is no con- 
Soup other than the children of other 
milies in the neighborhood. Children from 
© neighborhoods of these three families sel- 
P complete high school, and one criterion 
the success of the college pal project was 
a, Umber of the children in the project who 
this, completed high school. All students in 
Dle L ect who are old enough to have com- 
eted high school have done s0- Further, all 
othe had at least a try at college. AS with 
er projects of this kind, a second measure 
ect upon the college stu- 


d Success is the effi 
ent themselves. Hunt reported that not only 
“oes this type of project keep counselors from 


“topping out of college; but also the students 


121 


are learning about human relationships by 
dealing directly with people who are having 
problems in the community. Moreover, they 
appear to be learning an amazing amount 
about research methodology, including the 
vernacular. j 

The Department of Health, Education, and 
Welfare (1966) has noted that students rep- 
resent a large and relatively untapped source 
of volunteer manpower for institutions hous- 
ing juvenile delinquents. It is suggested that 
college students may more easily break the 
psychological barriers that often separate 


juvenile delinquents from adult workers since 


alienated young people usually trust another 


young person more than they trust adults. In 
the probation program at Boulder, Colorado, 
college students are presently serving as tu- 
tors, interviewers, and discussion leaders for 
the delinquents. 

Gorlich (1967) further postulated that the 
functions of student volunteers in institutions 
for delinquents are threefold. First, they prove 
to the delinquent that someone on the outside 
really cares about them. Also, college students 
provide the young person with a role model. 
Finally, the students can later help spread the 
word about institutional needs. 

Zunker and Brown (1966) supported the 
idea that youth-to-youth counseling may be 
ore effective than adult-to-youth counseling. 
A sample of 106 college freshman, half males 


and half females, received 61 hours of aca- 


demic adjustment guidance from same-sexed 


professional counselors. Trained under identi- 
cal conditions, upperclassman student coun- 
selors gave equivalent guidance to all other 
beginning freshmen at Southwestern State 
Texas College. Student counselors also used 
identical guidance materials, followed identical 
counseling activity sequences, and were pro- 
vided facilities equivalent to those used by 
the professional counselors. A matching sample 
of 80 men with 80 women was subsequently 
drawn from the freshmen receiving student- 
to-student counseling. Student counselors 
were found to be as effective as professional 
counselors 0n all criteria of counseling effec- 
tiveness; in fact, student counselors achieved 
significantly better results than did the pro- 
fessional counselors on the majority of varia- 
ples used to measure the outcome of counsel- 


mt 
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ing. Furthermore, student counselors received 
& greater degree of acceptance from the 
counselees, and their counselees made signifi- 
cantly greater use of the information received 
during counseling as reflected by first semester 
grade point averages and residual study hab- 
its. 

This review does not exhaust or should it 
limit consideration of the settings or situa- 
tions in which college students can be used as 
therapeutic agents, Programs such as suicide 
prevention centers, groups for children of drug 
addicts or alcoholics, college sensitivity groups 
tutoring underprivileged children, teaching 
delinquent girls good grooming, visitations to 
the geriatric homes, and recreation therapy in 
schools for retarded children are all seeking 
motivated, sensitive individuals with inno- 


vative approaches who can fill their manpower 
need, 


PROBLEMS OF UsInc STUDENT VOLUNTEERS 


Not all of the literature concerning the use 
of student volunteers in the mental health field 
is positive, Umbarger et al. (1962) discussed 
the resistance of the mental hospital’s ward 
staff to the student volunteer and attributed 
much of the resistance to staff jealousy; that 
is, students have time and Opportunity for 
personal contact with the patients, whereas 
staff is kept busy with maintenance. An article 
in the Pennsylvania Psychiatric Quarterly 
(Volunteer Coordinators Resources, 1966) 
Suggested that resistance to the volunteers 
Occurs because of three reasons: (a) lack of 
information on the part of ward personnel, 
(6) insecurity on the part of the staff, and 
(c) the image of the volunteer as a nuisance 
rather than an intricate part of the program. 

Haun (1968) raised a strong objection to 
the establishment of “lay administered ghet- 
tos for chronically ill,” but Rioch (1966) 
Suggested a flexible attitude and suggested 
that psychologists and psychiatrists are not 
exempt from feelings of jealousy. 
If we have invested long years of hard work Es 
achieving a high professional status, including on 
courses that were dull and many examinations ome 
were nerve wracking, and we are told UN job 
young bit of a girl with no training can 2 eat ie 
as well as or better than we can, it is natura’ 


E s 966, p. 
should try to find some objections [Rioch, 1966, p 
291]. 
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Reiff (1967) discussed the dangers E. 
power struggle between professionals and Es. 
professionals and the tensions surroun "i 
such struggles. Perhaps the power struggle va 
already begun as the problem of controls Sa 
nonprofessionals is considered by be 
(19693). He suggested that profess ona 
should begin now to ascertain kinds of F 
trol over nonprofessionals which can the 
should be established. He suggested that M. 
controls would be not only to ensure prO aii 
tion of the service recipient but also to ems 
the effectiveness of the service rendered. 

Another practical problem concerning uti 
use of college students as psychotherape 
agents has to do with the kind and pro 
training needed prior to student-patient ofes 
tact. The question of training of nonp 
sionals is a topic worthy of a separate E E 
and numerous experimenters have ed 
the problem (Berenson, Carkhuff, & N Yay 
1966; Carkhuff, 1968; Felzer, Bullock, ^ & 
& Wolf, 1968; Hart, Kroll, Berkowitn 
Woolcock, 1968; Linden & Stollak, 1 pr 
Parsons & Parker, 1968; Sines, Silver, & 
cero, 1961; Truax, 1967; Vesprani, 1969). J 

While their naiveté has previously ing 
presented as a potential advantage ot ai 
college students, it also is possible that ! ent 
be a liability. The unsophisticated nn pv 
could project his own difficulties onto hi ow 
tients or burden the patient with ^ arn 
personal problems. Without intending ? ay" 
the college student may be tempted to t. 
at psychotherapy by asking personal e 
tions and attempting to interpret the patie Jl 
verbalizations. An additional potential pít 
of using college students in the mental ÉÁ€- 
setting is that they tend to be temporary P a 
ticipants who have a peak experience Ld 
Setting without future responsibility. S 
they are not forced to realize or continue W ] 
the problems encountered by profession’ e 
they may not feel the responsibility for al 
long-term welfare of the patient. A furth 
Possibility is that because they have nothi”? 
to lose, that is, professiona] status, they ma 
exploit the position or relationship with th 
patient to satisfy their own needs, 

There is also concern with the Hawthorn 
effect. There is no way of knowing to whe 
degree the student’s enthusiasm and dedic? 


the 
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tion i b 
M ee of newness to the situation 
that the E essional attention. It is possible 
the En NSHE institutional climate of 
vigor after MAN SENVE tò dampen the student’s 
efficacy 2d a e months. Not only the total 
efficacy at a students but their relative 
Periods of ti the beginning and at varying 
found the cue should be investigated. If it is 
their auf that the college student agents lose 
time, the ese after a certain period of 
given pro, seni student is involved ina 
the mE va could be limited, thus giving 
riences of SUR advantage of only peak expe- 
Oped, h te student therapeutic agent. It is 
Strengths owever, that the freshness and 
many of d the college student will offset 

eir utiliz € inevitable difficulties created by 
“The ere As Cowen (1967) reflected: 
Suffered und health movement has probably 
rom its more from its errors of omission than 
be well errors of commission, and we would 
the jus, MA to begin to correct some of 
3 ner, even at risk of the latter [p. 


1].» 


SUMMARY 


ri dbsitous findings of this review are 
Students hag few studies dealing i 
Most of Qs therapeutic agents, anc r ae er, 
condu ose investigations which have een 
ucted are so methodologically inadequate 

a E it is impossible to draw firm conclusions 
Ut the relative effectiveness of college stu- 


d 
ents as therapeutic agents. 
d € studies are plagued 
bia which are charac 
les th studies in general anc 0 
the f particular. But even more 
act that few of the investigat 


[: 
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lations of the studies are extremely diverse; 
for. instance, Poser (1966) used chronic 
schizophrenics, Umbarger et al. (1962) used 
“chronic psychotics.” Spoerl’s (1968) popula- 
tion included hospitalized college students 
while Goodman’s (1967) students worked 
with troubled boys. Stollak’s (1969) play 
therapists worked with children of unstated 
diagnoses from Michigan State Psychological 
Clinic. Also among the studies there is little 
consistency as to the kind or the amount of 
training. College students in some studies were 
given no training (Ralph, 1968; Spoerl, 1968), 
while others were given very specific training 
(Linden & Stollak, 1969; Stollak, 1969; 
Zunker & Brown, 1966). Further, there was a 
great deal of difference in motivation among 
the students. Some students received monetary 
remuneration (Goodman, 1967; Poser, 1966; 
Scheibe, 1965); others received college credit 
(Umbarger et al., 1962); and still others re- 
ed no extrinsic remuneration (Levine, 
1966; Spoerl, 1968). There were also differ- 
ences in the duration and frequency of stu- 
dents working. Some students worked one day 
a week (Spoerl, 1968; Umbarger et al., 1962) 
while others worked full time (Lawton & 
Lipton, 1963; Poser, 1966). 

Although there are not enough data from 
well-controlled studies to warrant conclusions 
concerning the relative efficacy of college stu- 
dents as therapeutic agents to patients, there 
is sufficient evidence to conclude that the 
therapeutic relationship has a definite, positive 
effect upon the college student therapist. The 
studies supporting this conclusion are much 
more methodologically complete than those 
describing the effect of the student therapist 
on patients; for instance, more than 60% of 
the former studies utilized control groups of 
students who did not participate in mental 
health programs. Further, over 90% of the 
studies describing the effects upon student 
therapists as a result of working in mental 
health settings used pre- and posttests which 
were relatively objective. 

: Personality changes such as positive changes 
in self-acceptance and moral judgments of a 
sexual and aggressive nature, greater self- 
understanding, increased seli-confidence, and 
enhanced identity formation have been noted 
by experimenters in college students partici- 


ceiv 


PA 


pating in mental health programs. It is felt 
that these studies are valid and that working 
in mental health programs may foster person- 
ality development in students in college. It is 
expected that college students are not unique 
in manifesting these changes, but most studies 
of nonprofessionals in mental health, other 
than college students, have not focused on 
these personality changes. One explanation is 
that personality changes in noncollege student 
volunteers participating in a mental health 
program occur less frequently or are less in- 
tense than changes in student volunteers, Tt 
may be that college students are more open to 
change, since goals and interests have not yet 
crystallized. 

Attempts to conceptualize and explain the 
personality changes which are not unlike those 
reported in psychotherapy have been under- 
taken by Holzberg, Knapp, and Turner 
(1966). They have presented four formula- 
tions. Their first formulation views the person- 
ality changes in the students as a function of 
a heightened sense of self-competence, while a 
second hypothesis suggests that changes occur 
as a function of an enlarged and more flexible 
System of personality constructs. The clarifica- 
tion of their self-concepts and identities is 
also seen as a basis for personality change. 
The fourth formulation suggests an “emo- 
tional catharsis.” One of the interpretations 
presented of how catharsis takes place is that 
“certain emotions may be extinguished by sat- 
uration, just as certain persistent habits may 
be overcome through excessive practice [p. 
404].” 

Reiff and Riessman (1965) hypothesized 
that added responsibility for a meaningful job, 
as well as the satisfaction garnered from ac- 
quiring new skills, may account for positive 
personality changes in some nonprofessionals. 
There also may be an increase in status and 
prestige as a result of working with profes- 
sionals which is incorporated into their self- 
concept. Furthermore, one may gain added 
stability and security by reasoning that one 
must be in good shape himself in order to be 
able to help others. , 

A final hypothesis to explain positive per- 
sonality changes occurring in nonprofessionals 
is an enhanced self-concept from the knowl- 
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edge that he can influence the life of others. 
The fact that he can effect positive change in 
others may be particularly important to the 
college student who often feels like a cont 
puter card in a multi-university. It should be 
pointed out that these formulations are pre- 
liminary and have value only if they can be 
stated operationally and submitted to à di- 
rect test under controlled conditions. 

It is felt that college student volunteer prO" 
grams in mental health serve a variety 
functions, including the providing of meaning 
ful educational experiences for college d. 
dents, alleviating the manpower squeeze E 
mental health, enhancing student developme? 
programs, preparing future generations E 
American leaders for constructive roles 1 
mental health, and last, but particularly t 
portant for the present, is the providing 0. 
students with an opportunity to effect Te 
and meaningful changes in their environmen 
—a constructive cause to which they can 
vote their energies. nt 

A number of problems face college stude 1 
volunteer programs in mental health. Accord 
tance by staff and even by professionals lY 
improved, but resistance to college studen 1 
acting as therapeutic agents still exists. ie 
other very closely related problem is the be 
nition of interaction appropriate to nonpro a 
sional college student therapeutic agents. A 
a third problem area for the college stude 
mental health program is the question of 
kind and amount of training. Most agree t he 
some training is necessary even if it is in M 
form of weekly supervision meetings. Div 
sity of opinion abounds, however, on the quri 
tion of the exact program of training. Furthe? 
there is the question of the kinds of contro" 
to be established over the nonprofessionals 
organizational and legal controls as well ? 
supervisory controls. d 

The use of nonprofessionals in general ". 
college students in particular offer promise Y 
the effort to provide more complete ment 1 
health services to an ever increasing popula” 
tion. However, researchers must move quick 
with sound research designs to establish 2 
firm empirical basis for justification of tb 
nonprofessional as well as for student develop* 
ment in working with distressed indiviquals 
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MODEL OF LEADERSHIP EFFECTIVENESS: 


FRED E. FIEDLER ? 


A REVIEW OF EMPIRICAL FINDINGS! | 
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University of Washington 


This paper reviews studies which tested 
leadership effectiveness. The model predi 


LPC leaders (“relationship-oriented” 
intermediate in favorableness, The 
but not fully supported by dat: 
studies extending the mode! 
ness dimension moderates 
organizational performance. 
formance for the coacting t: 
within the same group cl 


the type of leadership situation. 


A contingency model of leadership effective- 
ness, described in a theoretical paper 7 years 
ago (Fiedler, 1964), has stimulated numerous 
studies in the area testing the model as well as 
attacking it in a recent issue of this journal 
(Graen, Alvares, Orris, & Martella, 1970). The 
present paper reviews 25 investigations pur- 
porting to test or extend the model. 

The contingency model postulates that the 
performance of interacting groups is con- 
tingent upon the interaction of leadership style 
and situational favorableness. It has been sus- 
pected for some time that group effectiveness 
depends on attributes of the leader as well as 
of the situation (e.g., Tannenbaum & Schmidt, 
1958; Terman, 1904). 'The question in leader- 
ship theory has been, What kind of leadership 
Style for what kind of situation? The con- 
tingency model specifies that the so-called 
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and extended the contingency model of 


cted a curvilinear relationship such that 
leaders with low least preferred co-worker (LPC) scores 


perform more effectively in very favorable 


(“task-oriented”) would 
and unfavorable situations, while high 


) would perform more effectively in situations 
model is supported by data from field studies 
a from laboratory studies. Field and laboratory 
| supported the prediction that the situational favorable- 
the relationship between leadership style and group or 


ask groups were similar to those for interacting groups 
assification of the model, while training groups appear to 


Relations between leadership style and group per- | 
require a predominantly relationship-oriented le j 


adership style, irrespective of 


i 
“task-oriented” leaders perform more a 
tively in very favorable and very eye 3 
situations, while "relationship-oriented P. 
ers perform more effectively in situ 
intermediate in favorableness. The rs 
operationalizes leadership style as well as SI e: 
tional favorableness and, therefore, lends 1t 
to empirical testing. : ms OE 
This study first defines the main Lem on 
the theorv, and briefly reviews the finding (a) | 
which the model is based. It then presents "E 
validation evidence relevant to the m" ife 
prediction of group performance in rea ex 
studies and laboratory experiments; (0) e 
tensions of the model to a more broadly an 
hypothesis; and (c) an analysis of results D Í 
ing upon the reclassification and prediction 
performance of coacting groups. 


Definitions 


The main terms of the theory—leaderslif 
style, situational favorableness, and leadership ' , 
&roup effectiveness—are briefly described bel? " 
as are the definitions of inlera iin 
groups. 

Interacting groups. These 
the members work ¢ 
pendently on a com 
tions of individual ; 
cannot, therefore, re 
members, for this 


acting and code 


m 
are groups in he | 
operatively and intere 
mon task. The contri": 
members of these gro 
adily be isolated, and "a 
reason, are typically ™ 


d 


warde s 
B rron as a group. In contrast, in 
relative pario members perform their tasks in 
example dependence of one another, as for 
Riehl mere of bowling teams, men in 
in which oW uction or in training situations 
individual « » participant typically receives an 
training. score or evaluation at the end of 
g. 

in LAE style. The predictor measure used 

s of the contingency model is the 
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least preferred co-worker (LPC) score. This 
score is obtained by first asking an individual 
to think of all co-workers he has ever had. He 
is then asked to describe the one person with 
whom he has been least able to work well, that 
is, the person he least prefers as a co-worker. 
This need not be someone with whom he works 
at the time. The description is made on 8-point, 


bipolar adjective scales, for instance, 


pare : unfriendly 


uncooperat ive 


oe 
friendly : — ái 
8 7 6 5 + 3 2 1 
cooperative :——: $ £ 
8 7 6 5 4 3 2 1 


AS m o. ; 
$ a rule, 16 to 24 items have been used in 


Lee 
su, cales The LPC score is obtained by 
lo the abu item values, giving à value kt 
Son edis 
his least s E that the su bject he sehe 
Worable preferred co-worker in ied E red 
Value in oe tiat is, wt an eee dem 
Scale, A ] neighborhood o 5 or Mes pee 
erred c Ow score means that the eas p ie 
thee Worker is described in avery negativ i 
about ng manner, that is, an LPC oe 2 
bE mars (Fiedler, 1967a, p- 43). It should = 
his ee that the low LPC person descri = 
“ata preferred co-worker in à uniformly, 
üs Se oar tale tit or stereotyped mannet 
ion ] , bad.” The high LPC person $ descrip- 
pue a considerably greater item var — 
43 dard deviation of 1.43 for the high versus 
E fo the low LPC person). 
While hie has been difficult 
äsk-ori abels of relationship-or? € xcu 
ow 6 m have been given to high versus 
eading persons, the terms are somewhat mis- 
vorab; First, only in situations which are ue 
M dia (that is, stressful, anxiety pee 
cader the leader little control) do we r 
terms S nein which correspond to Los 
has f "iedler, 1967a). Second, Mitchell (197 : 
oh ound evidence that high LPC leaders ten 
ing € cognitively more complex in their think- 
B about groups: while low LPC leaders tend 
is Elve more stereotyped cognitively simple 
Sponses. Similar results: i.e., a-correlation of 
35) have been reported by Schroder and his 


to interpret. 
ented versus 


co-workers (H. Schroder, personal communica- 
tion, 1969). 

Thus, the LPC score must be seen as a mea- 
sure which at least in part reflects the cognitive 
xity of the individual and which in part 
motivational system that evokes 
relationship-oriented and task-oriented be- 
haviors from high versus low LPC persons in 
situations which are unfavorable for them as 
leaders. 

Situational favorableness. The variable that 
erates the relationship between LPC and 
ce is the situational favorable- 
It is conceptually defined as 
hich the situation itself pro- 
vides the leader with potential power and in- 
fluence over the group's behavior. Situational 
favorableness appears to be quite important 
in affecting a wide range of group phenomena, 
as well as interpersonal behaviors. It seems 
likely that this dimension may have far-reach- 
ing significance in other personality research, as 
well as in socia ological investigations. 

leness has been opera- 


comple 
reflects the 


mod 
group performan 
ness dimension. 
the degree to w 


] psych 


Situational favorab , 
tionalized in a number of ways which are dis- 
ater. The original work on the con- 


cussed | 


tingency model presented one method based 


on three component dimensions that affect the 
degree to which the situation provides the 
leader with potential power and influence. 
These are leader-member relations, task struc- 
ture, and position power. The hypothesis was 
that (a) it is “easier” to be a leader of a group 
that respects and accepts its leader, or in which 
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= LEADERSHIP POSITION POWER -+Y 


Fic. 1. A model for the classification of group task situations. (Reproduced with permission from The 
Harvard Business Review, September-October, 1965, p. 117). 


the leader feels accepted, than in a group that 
distrusts and rejects its leader. Likewise (b), 
it is considered easier to be a leader of a group 
that has a highly structured, clearly outlined 
task than of a group that has a vague, un- 
Structured, nebulous task; (c) it is easier to be 
a leader when the position is vested with power 
(when the leader has the power to hire and fire, 
promote and transfer, give raises or lower 
salaries) than it is to be a leader who enjoys 
little or no power over his members: It is easier 
to be a general manager than the chairman of 
a volunteer group. 

Leader-member relations were considered to 
be the most important of these situational fac- 
tors, and subsequent studies have supported 
this supposition (Fishbein, Landy, & Hatch, 
1969; Mitchell, 1969). Detailed instructions for 


ne een. 
task structure, and position power have b 


described. Leader-member relations al. 
er 


measured by means of sociometric pref! ich 
wi 


ratings or by a group atmosphere scale Ack$ 
is similar in form and content to LPC, but Aes 
the subject to rate his group as a whole. d 
for rating task structure and position power ro, 
described in Fiedler (1967a, pp. 24, 28, ^ 
281-291). NE 
We could then classify group situations A 

DI 


TA ions) | 
obtaining measures of leader-member relatio ! 


means of the three dimensions. For this Pe 
pose, all groups were classified as falling ae 
or below the median on each dimension- oft 
led to an eight-celled classification br d 
which can be depicted as an cight-celled © 2 
(Figure 1). Each of the eight cells or "octo id 
can be scaled in terms of how much powe” * 


CONTINGENCY MODEL OF LEADERSHIP 


1.00 


Correlations — Leader LPC/ASo 
and Group Performance 


Fav. T JE 


Leader 


Ldr. Memb. 
Relations 


Good Good 


Task 
4 Str. 
Structure xs 


Leader 
Position 
Power 


Strong Wook 


between leadership, LPC sc 


Vic. 2, Correlations 
ness dimensio: 


of the situational favorabler 


Influence a leader might have in such a situa- 
tion. Obviously, a liked and accepted leader 
Who has a clear-cut task as well as power over 
the fate of his members (Octant I) will have 
à very favorable situation. Conversely, à dis- 
trusted chairman of a volunteer group with 
vague problem-solving or policy-making task 
(Octant VIII) will be in a very unfavorable 
Sltuation to exert power. Other octants fall 
etween these two extremes. 
Leadership effectiveness. Th i 
€ leader is here defined in terms of the major 
eisignment of the group; that is, the leader's 
€ctiveness is measured on the basis of the 
Kroup’s performance of its major assigned task. 
mile such other aspecte of Eor persona 
Browth i | > piper -oncomitants of 
grou might be important conco 3 
Side effectiveness, they are here pon 
Cont ed to be the primarv criterion, but at 
We ributors to performance. In other wows, 
Co evaluate the performance of an orchestra 
ductor not by his ability as a musicologist. 


e performance of 


Good 


Unstr. 


Strong 


and group effe 
n for studies of interac 
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Iv eve xr xr vm 
OCTANTS 
Mod. Mod. Mod. Mod. 
Good Poor Poor Poor Poor 
Unstr. Str Str. Unstr. Unstr. 
Weok Strong Weak Strong Wook 


ctiveness plotted for each cell or octant 
ting groups conducted prior to 1963, 


f his musicians, but by how 
well his orchestra plays. Whether happy 
musicians play better than unhappy musicians, 
or whether the man who is a great musicologist 
is a better conductor is an important research 
question in its own right. The major question 
asked here is the relationship of leadership 
style (specifically LPC) and group or organiza- 


tional effectiveness. 


or the happiness o 


Previous Results 
ng groups from 15 studies, antedat- 
were classified according to their 
favorableness, and the correlation 
der LPC and performance was then 
computed for each set of groups. The correla- 
tions between the leader’s LPC score and the 
group’s effectiveness measures, when plotted 

al favorableness, generated 


situation 

ped distribution indicating that the 
performed more effectively 
lers in very favorable and 
gh LPC leaders 


Interacti 
ing 1963, 
situational 
between lea 


against 
a bow-sha 
low LPC leaders 
than high LPC lead 


very unfavorable situations; hi 
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performed more effectively in situations inter- 
mediate in favorableness (see Figure 2). 

We here review the validation evidence that 
has accumulated since publication of the model 
in 1964. Before specifically reporting any of the 
studies, it should be stressed that the group 
classification system was viewed “as a very 
convenient starting point for presenting the 
empirical results which we have obtained in 
our research on interacting groups [Fiedler, 
1967a, p. 34].” Improved methods for mea- 
suring situational favorableness were expected 
to be developed in time. The three component 
dimensions did, however, turn out to be a very 
convenient method for testing the model since 
operationalized measures were available. 


VALIDATION EVIDENCE OF THE 
CONTINGENCY MODEL 


Classification of Studies 


A number of studies designed to test the 
model have been conducted by various inde- 
pendent investigators as well as by the writer 
and his associates. Some of these investigators, 
by design, and others, by oversight, have not 
followed the methodology originally described. 
Different operationalizations of situational 
favorableness were used in some studies, while 
others extended the model to coacting groups 
(e.g., Hill, 1969; Hunt, 1967), and some used 
leadership style measures unrelated to LPC 
(Shaw & Blum, 1966). These differences in 
methodology and divergencies from the model 
are, of course, quite appropriate and desirable. 
However, studies that do not conform to the 
explicit methodology of the earlier work can- 
not be used as exact tests of the model. 

This divergence in method presents difficul- 
ties only where the investigator and this re- 
viewer disagree on the appropriateness of a 
study for testing the contingency model, or 
where the methodology is inadequate to test 
the model. This problem has here been handled 
as follows: 


1. Four independent judges carefully read 
the definition of interacting and coacting 
groups, and the definitions of the various eos 
dimensions of the situational favorableness di- 
mension presented by Fiedler (19672). a 

2. The judges were then given all iie 
that purported to test the contingency model. 
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They were asked to read the entire methodol- 
ogy section in the case of shorter investigations, 
and relevant sections of very extensive studie — 
They were not asked to read the results. Using 
the scales described by Fiedler (19672), d 
judges classified each study in terms of jc 
group situation into which it should 
classified. idi i 

3. Groups were included among the vali d 
tion studies of interacting groups if three ih E 
four judges could agree that the groups bw 
interacting, and if at least three of the jide 
agreed into which octant the groups should 
classified. 


Because of the nature of the pre-1963 1° 
search on which the original analysis was pe. 
all groups in Octant T, II, and V were n 
field studies of natural groups and organ! i 
tions, and all but one set of groups in MAS 
III, IV, VII, and VIII were from labora ery 
experiments using ad hoc groups. It is viy 
difficult to reproduce certain effects commo el 
found in natural groups under experimes. y 
conditions. These include, for example, a 
leader position power, high stress, and ps. 
poor leader-member relations. For this e 
the model can be more meaningfully eva 
as a predictor if natural groups and ad n 
groups in laboratory experiments are © 
sidered separately, as well as together. 


Field Studies 


j 
t 
The Hunt studies. The first field study t° e 
the contingency model was conducted by f 15 
(1967) in three different organizations, name 
a large physical science research laborato 
a chain of supermarkets, and a heavy E 
chinery plant. In each case Hunt gota | 
Sroup atmosphere scores from managers En. 
foremen, and ratings of position power and ta 
Structure from management personnel E 
higher levels in the hierarchy, Higher manag' 
ment of the rese 
manufacturing pl 
ance ratings, whil 
an index of sales per employee manhour W” 
derived for managers of me 
The position power of all 
men was La a B be high, 
rated to be high fo 
and for meat Ea developmental researe™ l 


arch laboratory and in , 
ant also provided perfor” 
€ objective criteria base 


at markets. 


Managers of basic 


research. groups and for 
E^ foremen of the heavy machinery plant. 
€ correlations obtained for each of these 
Pero by octant, are presented in Table 1. 
sented jum on coacting groups are pre- 
à later section.) 
E one firm. A second set of real-life 
E investigated by Hill (1969). The 
Dm. A Tin conducted in a large electronics 
enginee "uh study dealt with supervisors of 
Sembly ne teams and with instructors of as- 
rated | Broun: Assembly-line instructors were 
lask by Hill's judges as having structured 
Sand low position power, while supervisors 
Struct g as having un- 
Netured tasks and high position power. 
a neber relations were measured by 
omi? visors group atmosphere scores, trichot- 
mized, with the upper third considered to 
‘ie ol leader-member relations, and the 
m t third considered to have poor leader- 
ember relations. 
teat correlation for nine assembly instruc- 
(Oc teams with high group atmosphere 
een II) was — .10, and for the nine teams 
` * low group atmosphere (Octant IV), — 24. 
Su i Correlation between eight engiite- us 
mcs Vis LPC and performance ter high 
or Up atmosphere (Octant III) was — -29, an 
Sph eight supervisors with low group atmo- 
ni groups (Octant VID), it was — bs 
ua i Health Teams: T. REP : a 
healt} gen (1969) conducted a study pi iom 
Er. h volunteers in Honduras during t T n 
m of 1966. The sample consisted of 22 
Magers who were assigned to teams ID 
o duras to operate public health clinics and 
hoi orm community development es in 
Were ne towns and villages. Formal iu 
we S not assigned. The teams" informal lea BE 
€re identified at the end of the volunteer 
Pd on the basis of sociometric T saepe 
ru Ponses. The task of these groups, namely, to 
N a public health clinic and, time permitting, 
im cama _some community development 
ins was fairly well-specified by the sponsor- 
'E organization. Problems arose when the 
illagers and the town officials failed to co- 
Operate and when the population was unsup- 
Portive or hostile. Under these conditions the 
Volunteers experienced Neq s stress, 
Often verbalized 25 feeling at a loss about 


Of eng} A 
ngincering groups were rated 
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TABLE 1 
TEST OF THE CONTINGENCY MODEL 
ix Hunt’s STUDIES 
| 
Octants 
Sample 
I IIL y VII 
ch chem- 
basic 
arch .60 (6) 430 (5) 
Research chem- 
: devel- 
opment —.67 ( 7) 
Meat markets —.51 (10) 21 (11) 
General foremen: 
heavy manu- 
—.80 (5) —.30 (5) 


facturing 


Note.—Numbers in parentheses indicate number of cases. 


what to do." Under these stressful conditions, 
the team members were more or less on their 
own in trying to cope with the problems they 
encountered. All judges who evaluated the 
study agreed that the position power of the 
leader was low, and three of four judges con- 
sidered the task to be relatively structured in 
the stress-free condition, but unstructured in 
the situation in which village support was 
absent. 

The informal leader’s group atmosphere 
scores were used to measure the leader- 
member relations. Groups operating in co- 
or favorable villages could then be 
s falling into Octants II or VI, de- 
e leader’s group atmosphere 
n uncooperative, unfavorable 
ssified as falling into 
gain depending on the 


operative 
classified a: 
pending on th 
score; groups i 
villages could be cla: 
Octants IV or VIII, a 
leader's group atmosphere scores. 
Public Health Teams: IT. A second study, 
practically identical in procedure, was con- 
ducted in the same organization (but with 
different volunteers) 2 years later by O’Brien, 
Fiedler, and Hewett.? One major difference in 
training was that most volunteers had received 
a culture training program developed for use 
of the project- This program made the situa- 
tion somewhat more favorable since it provided 
some information that enabled the volunteers 
to understand and to communicate more effec- 
tively with their host nationals. A study of the 
30'Brien, G- E., Fiedler, F. E,& Hewett, T. The 
effects of programmed culture training upon the per- 
formance ©! volunteer medical teams in Central 
Urbana, Illinois: Group Effectiveness Re- 


America. i i i 
3 m Laboratory, University of Illinois, 1969. 


134 


TABLE 2 


CORRELATIONS BETWEEN LEADER LPC Scores AND RATES: TEAM PERFORMANCE IN Two STUDIES 
(1966, 1968) or PUBLIC HEALTH VOLUNTEER TEAMS 


FRED E. 


FIEDLER 


Situation | 


1966 | 1968 | 
| Octant | E Median 
Group Tark Position Correlation | x | Correlation | x | 
atmosphere structure power | | | 
| | = i 
High | High Low I | -n | 13 | -4 | 7 2 
High Low Low IV | 00 | 5 | AT 9 E 
Low High Low VI .67* | 9 — 45 8 2 
Low | Low Low VIII | —51 | 12 | —.14 7 -. 
| : EA E s 
*p «.05. 
i 1 — á es- 
effects of this program on adjustment and per- groups must have been coacting (c.g, M 


formance, conducted in 1967, provided evi- 
dence that individuals who had received 
"culture assimilator" training had adjusted 
and performed more effectively abroad than 
those who had not. 

The correlations between leader LPC and 
performance ratings of the headquarters staffs 
are presented in Table 2 for both studies. The 
median correlations for these groups support 
the model (correlations for Octant VI were not 
predicted). 

The patterns of the 1966 and 1968 correla- 
tions both form a bow-shaped pattern, but the 
positive correlation was found in Octant VI in 
the 1966 group, but in Octant IV in the 1968 
group. 


Unclassifiable Field Studies 


A number of other studies designed to test 
the contingency model cannot be included in 
the present analysis because the judges rating 
the published research could not agree on the 
classification of the groups. Thus, a major 
study by Butterfield (1968) attempted to com- 
pare five theories of leadership, namely, four 
theories developed at the University of 
Michigan and the contingency model. The 
study was conducted in an administrative unit 
of a federal agency. The sample of groups was 
not clearly described. References to groups in- 
dicated, however, that “the various work units 
were engaged in rather different functions rang- 
ing from delivering mail to performing financial 
audits . . . [p. 61],” and measures of effec- 
tiveness such as "typed papers, delivered mail, 
dispatched automobiles, coding systems . . . 
[Appendix B].” These descriptions led all 
four judges to the conclusion that a number of 


senger services, typing pools, motor pool dis- 
patchers, etc.). Equally important, however 
Butterfield’s measure of leader-member rela- 
tions consisted of only two questionnaire items» 
one of which dealt with how much annoyance 
an employee feels with the manager. The A. 
portant leader-member relations dimensio 
was, therefore, not adequately represented n 
Butterfield's study. 3 

A field study by Kretzschmar and Lueck 
(1969) dealt with 67 managers in business aci 
ministrations of four industrial companies !^ 
Germany. In contrast to other studies; 4 
measures of task structure, leader-membe! 
relations, and position power were based i. 
supervisors’ own ratings. In addition, super 
visors also rated their own effectiveness. o 
studies have shown that leaders’ evaluation” 
of their own performance typically do not co E 
relate with objective measures, and thes 
methodological differences made it impossib 
to compare this study with others in the group’ 

The field studies which can be used to t€? 
the contingency model are, then, those za 
Hunt (1967), by Hill (1969), by Fiedler et p 
(1969), and by O’Brien et al. (see Footnote 7. 
These results are now discussed, along wit 
those obtained in laboratory studies. 


P 


Laboratory and Field Experiments 


The contingency model was based on - 
studies in Octants I, II, and V, and on TÉ 
trolled experiments in Octants III, IV, d 
and VIII. Since that time a number of em 
and laboratory experiments have attempt? gi 
test the predictions of the model in all octa” gt 

The Belgian Navy Study, This was à 


late, 
: e it 
field experiment conducted in cooperation w 


l 
: 
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TABLE 3 


CORRELATIONS BETV 


EEN LEADER LPC SCORES AND GROUP PERFORMANCE SCORES OBTAINED 


IN STRUCTURED AND UNSTRUCTURED TASKS OF THE BELGIAN Navy STUDY 


Homogeneous groups 


Heterocultural groups 


Octant GA TS PP Correlations* Octant GA TS PP Correlati 
] Mon 
; : t + + —.72 IX + F E .03 77 
BITITIS 37 x | +/+] - 77 | -.53 
n a x. ds XI a he 4 20 | —26 
ESSET ‘08 xm | & | — | — |=ax | x 
y e Fi J 16 XII = + m 08 | —.19 
we l= |e | = ‘07 Sy | -— | ek i = 53 | —90 
wn c es deck 26 my e — = | aay Be 
= p: L | 3 XVI = = = EU l a 
Note.—GA = group atmosphere, TS = task s = positi P 
Note.—GA, = group atmosphere, T$ taak ring to uie order of POT ration i se 6); 


ne Belgian naval forces. The study involved 
96 three-man teams which were experimentally 
assembled so that 48 teams would be culturally 
homogeneous (all French or all Dutch speak- 
ing) and 48 teams were heterogeneous (the 
leader from one language sector and the mem- 
bers from the other). In 48 teams the leader 
was a petty officer who had high position 
power, while the other 48 teams were headed 
by a recruit who had low position power. The 
groups were given four different tasks, one of 
which was coacting (teaching men to assemble 
an automatic pistol). The coacting task and 
one of the two structured tasks could not be 
reliably scored, and only one structured and 
one unstructured task were, therefore, suitable 
for more intensive analysis. Leader-member 
relations were assessed on the basis of group 
Climate scores obtained from leaders after each 
of the task sessions. 

Since the contingency model, 
1964, was based on culturally 
Sroups, that is, groups in which 

ad the same mother tongue, OF 

€ used for validation purposes 


presented in 
homogeneous 
all members 
nly these will 
at this time. 


j Ple 3 presents the correlations between 
eader LPC scores and group performance 
h with 


Seas. 5 . Aci 
Cores. Two correlation coefficients, eac 


m The Graen et al. review (1970) included all three 
€racting tasks in the analysis of evidential results of 
che a s model. However, it was clearly pointed 
stry by the present writer (1967a, p. 161) that the m 
Ge Clured task was methodologically inadequate; 
top obtained perfect scores and 62 groups ma le 
fir. al of 189 errors by not following instructions. T ie 
n St task was, therefore, quite unsatisfactory and cou 
9t very well have yielded anything but random results. 


n — 6, were computed since the tasks were 
presented in counterbalanced order. As can be 
seen, the findings do not support the con- 
tingency model which postulates a curvilinear 
relationship. 

A post facto analysis of the data suggested 
that the laboratory manipulations had not 
been adequate for the purpose of creating 
a sufficiently unfavorable situation for the 
leader. The homogeneous groups probably did 
not develop really poor leader-member rela- 
tions, and there was also a question whether 
the supposedly structured task (requiring the 
group to find the shortest route for a ship which 
had to cover 12 ports) was sufficiently struc- 
tured. The task, basically a topological puzzle, 
requires problem solving, and subsequent 
studies have shown this type task to be inter- 
mediate in structure. A bow-shaped relation- 
ship similar to that of the contingency model 
did emerge when heterocultural groups were 
included in the analysis to intensify the situa- 
tional difficulty for the leader. Heterocultural 
are, of course, much more difficult to 

and leaders reported significantly 
higher anxiety and greater tension in heteroge- 
neous groups than in homogeneous groups. 
This effect was, however, not predicted. 

The Japanese student study. Shima (1968) 
tested the contingency model in Japan, using 
two Guilford? tests, namely, the Unusual Uses 


s Guilford, J- P» Berger, R. M., & Christiansen, PR: 
A factor analytic study of planning I: Hypothesis and 
description of tests. Los Angeles, California: Psycho- 
logical Laboratory, University of Southern California, 


1954. 


groups 
handle, 


w- F 
ay 
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Test considered to be moderately structured, 
and an "integration" task which required 
groups to invent a story using 10 unrelated 
words. All subjects were high school students, 
and these were assembled into 32 groups. The 
leaders were elected by the group members, 
and the group's relations with the leader were, 
therefore, assumed to be good. However, the 
judges rating this study disagreed with Shima’s 
classification of position power. Since the leader 
was elected and, therefore, had obtained his 
position from his fellow students, and since the 
groups were ad hoc, the judges rated position 
power in this study as being low. Based on the 
assumption that leader position power was low, 
the groups would be classified as falling into 
Octants II and IV. The corresponding correla- 
tions were — .26 (n = 16) and .71 (n = 16, 
p < .05), thus supporting the model. 

Church leadership groups. Mitchell (1969) 
conducted a small study of group performance 
as part of a leadership training workshop to 
determine the relationship between cognitive 
complexity and LPC. The participants were 
members of Unitarian churches who attended 
à leadership workshop. Each of the groups per- 
formed two of four tasks, one structured, and 
one unstructured. The tasks consisted of find- 
ing the shortest route for (a) a school bus and 
(b) a cross-country road race. The two unstruc- 
tured tasks were to write a position paper on 
the church's stand on (c) legalizing abortion 
and (d) a “Black Caucus" within the Uni- 
tarian-Universalist church. Leader position 
power was low. 

Mitchell (1969) originally computed corre- 
lations for groups with poor as well as with 
good leader-member relations, and these corre- 
lations were included in the Graen et al. (1970) 
critique. The Mitchell study was not, how- 
ever, designed as a test of the model, and an 
analysis of the data showed that only 2 of the 
64 group sessions in this study had group 
atmosphere scores below 55, a score which is 
roughly at the median of group atmosphere 
scores for similar studies. As pointed out be- 
fore, a finding of this type is neither unusual 
nor unexpected in ad hoc groups. Most re- 
searchers, testing the model, have, therefore, 
tried to increase the difference between gEGBDS 
with high and low group atmosphere scores zd 
using only the upper and lower thirds of the 
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distribution, whenever this was possible. In 
light of the high group atmosphere scores In 
the. Mitchell study, all groups were classified 
as falling into Octants II and IV depending 
upon task structure. The resulting rank-order 
correlations for Octant II were .24 and -17 
(n = 16), and for Octant IV, 43 and 38 
(n = 16). 
culive development workshop. An almost 
identical study was conducted by Fiedler 2$ 
part of an executive development program. 
Here again, all but one of the group atmosphere 
scores was above the usual cutting score, 4" 
all groups were classified as having high group 
atmosphere. The groups were given a relatively 
structured task and an unstructured task (i.e 
routing a truck convoy and writing a recruiting 
statement inviting college students to becom 
junior executives). All leaders were rated a 
having low position power, and thus all grouP 
fell into Octants II and IV. The correlations 
were .34 and .51 for the structured and a 
structured tasks, respectively (gs = 11). (Se 
also the section reviewing results on groups b 
training which might apply to the churc? 
leadership and the executive developmen 
workshop studies.) ee 
West Point Cadets. Skrzypek® recently. tom 
pleted a study of 32 four-man groups compose 
of cadets at West Point. Leaders were chose"? 
from among a pool of 400 men whose LE 
scores fell either one standard deviation aboV 
or below the mean. Members were assigned, a 
random. Unlike other laboratory studies, wines 
used postsession questionnaires, leader-membe: 
relations in this study were determined a prio” 
on the basis of previously obtained sociometri" 
ratings that identified well-accepted and not 
accepted leaders among the cadets. Positio” 
power was varied by informing the group th 
the leader in the high position power woulc s 
evaluating each of his group members at th 
conclusion of the tasks and that these evalua” 
tions would become part of the individual's 
leadership score (a very important aspect o 
West Point’s system). The leaders in the low 
Position power were instructed to act as chai! 
men, and the group was told that the member? 


* Skrzypek, G. J. The relationship of leadership style 
to task structure, position power, and leader-membe 
relations. West Point, New York: United state? 
Military Academy, 1969. 
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. 8 à group would be evaluated on their 
Performance. 
po group performed one structured task 
order. m unstructured task in counterbalanced 
p Ben i Ie Structured task consisted of drawing 
scale E bustadks anda military-post area to 
groups : a unstructured task required. the 
cate er lie design a program which would edu- 
World a isted men in overseas assignments on 
thr politics „and maintain their interest 
roughout their tour. 
E results were as follows for each of the 
E. e Octant I, — .43; Octant IL, 
Octant a HII, .10; Octant IV, 33; 
08: Oc Vy 28; Octant VI, 13; Octant vil, 
Y Stant VHI, — .33. Thus, while none of 
: ee reached the 05 level of sig- 
Were Nee, all but the correlation for Octant III 
In the predicted direction. 


— 


| Nonclassifiable Studies. 


c e nurses. A study by Reilly? was e 
One i because the judges could not agree W s 
ia or with the investigator on } ‘ 
-Brou on power of the leaders. Reilly studie 
p Successively ee of nurses who were P 
Dini. y structured and unstructure d - 
ad hi problems. Whether the leader ui 
bus d. position power was dada m 
ET iN € leader was a fellow sient DÀ 
consisten responsibilities were limited: ey 
he n of making certain arrangements br 
iuis cup as well as assigning p pL 
is d. aeh student would obtain. T nig 8 d 
e in ed in a later section, since it may 2 
eri d as a training study- 
Eu mental change of position 
Neale Nonclassifiable study was cond 
ma and Shiflett (S. Nealey, P i 
a unication, 1969) who attempte Nori 
fay, nentally to induce a change of situational 
Orableness from Octant III to Octant IV 
e h Ice versa. The groups were assembled on 
ep esis of LPC and intelligence scores. The 
ras timental manipulation, changing ay 
Prior did not succeed although the I 
elation the experimental manipulation y ië e 
ns between LPC and group performance. 


C 


power. A 
lucted by 
ersona 

ex- 


of different leadership styles 
experiment. Ames, Iowa: 
| State. University, 


"na 
on Reilly, A. J. The effects 
Eroup performance: A field 


ng 3 ^ à 
19gg ral Relations Center, Tow 
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Here again, the judges did not agree on 
whether the leader’s position was, in fact, 
strong, since in the appropriate experimental 
condition the leader’s position power was 
established basically by instructing the leader 
to role play a person with a powerful or weak 
position power. 

Graen, Orris, and Alvares (1971) described 
two laboratory experiments which were spe- 
cifically designed to test the contingency model. 
These studies also constituted the main basis 
for a recent article in this journal by Graen 
et al. (1970), which questioned the adequacy of 
the model. 

Graen et al. (1971) used 78 and 96 male 
college students randomly assembled into 52 
and 64 three-man groups, respectively. Each 
group worked on two tasks. The students were 
paid for their participation, and one person in 
each three-man group was chosen at random 
to serve as the leader during the first task. 
Another member of the same group served as 
leader during the second task. 

Each of the groups was given one structured 
and one unstructured task in counterbalanced 
order. Using an 8-point scale for assessing task 
structure (Fiedler, 1967a, p. 25ff) Graen et al. 
(1971) chose two structured and two unstruc- 
tured tasks. Position power was varied by 
giving the leaders of one set of groups 
superior formal status relative to the members . . + 
special information about the task, and... the 
highest decision-making authority and responsibility. 
In the weak leader condition, the leader role was one of 
discussion leader without special information and with 
decision-making authority and responsibility close to 


that of the members. Each group worked on both tasks 
under only one of the two power conditions. The task 


sequence was randomized within power conditions 
[Graen et al., 1971, p. 198]. 
ere scores were obtained 
after each task session, and the groups were 
divided into those whose group atmosphere 
fallen above and below the median 
set. Groups Were then assigned to 
te cell of the contingency model 
by dichotomizing the niee situational vari- 
ables of position power, task structure, and 
a here. The results of the two ex- 


group atmosphere: 5.75. 
periments are given in Table 4. 

Graen and his associates claim that their 
two studies followed as closely as possible the 


prescriptions of the contingency model and 


Group atmosph 


scores had. 
for the entire 
the appropria 
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TABLE 4 
CORRELATIONS BETWEEN LEADER LPC AND PERFORMANCE IN THE GRAEN ET AL. STUDIES E 
| Octant _ 4s 
Experiment ———— 
| 1 | I n Iv v | wi VII vill 
I iG — 
1 AT —.41 E 33 25 —.39 ES = p 
2 13 Br .02* —.08* = 52% — 43 AS ` 
me 


a Correlations in a direction counter to the hypothesis of the model. 


research methodology used in its development. 
Because their results were nonsignificant, the 
authors concluded that their studies, therefore, 
“cast doubt on the plausibility of the con- 
tingency model [Graen et al., 1971, p. 201].” 

Nonsignificant results obviously can occur 
for any number of reasons, including inade- 
quate experimental design. A study that 
attacks a theory, therefore, must not only be 
methodologically sound (especially if it does 
not support an alternative hypothesis), but it 
must also guard against the possibility of ob- 
taining nonsignificant and randomly dis- 
tributed data because of inappropriate or 
marginal experimental manipulations. 

A methodological critique in the Journal of 
Applied Psychology (Fiedler, 1971) pointed to 
various weaknesses in the manipulation of 
three independent variables in the Graen 
et al. (1971) experimental design. The manipu- 
lations were inadequate and make the studies 
inappropriate for inclusion in the present 
analysis. A summarv of the inadequacies is 
presented below. The discussion is based on 
data in the original Graen et al. (1971) paper 
as well as subsequent data included in a re- 
joinder to the methodological critique of their 
studies, 

1. Task structure. Task structure was as- 
sessed by means of a rating scale with scores 
ranging from a maximum of 8.0 to a minimum 
of 1.0 (Fiedler, 1967a). The average task 
structure scores of the studies on which the 
original contingency model paper was based 
averaged 7.39 for the structured and 3.15 for 
the unstructured tasks. In contrast, the scores 
in the Graen et al. (1971) studies were 5.85 and 
5.45 for the two structured tasks, and 3.69 and 
3.60 for the two unstructured tasks. The scores 
for the structured task were, therefore, less 
than 1 scale point above the cutting score of 


5.0, and the differences between structured 
and unstructured tasks were only 2.17 and 1. f 
scale points. This clearly represents a TH 
weak manipulation as compared with the diffe 
ences of 4.24 in the original tasks. 

While it may be argued that the task st É 
ture scores in some of the original studies M. 
similar to those in the Graen et al. studies; a 
experiment which seeks to disconfirm a theo H i 
should not use marginal manipulations in t6 
ing the null hypothesis. 

2. Position power. Graen et É 
manipulated this variable by giving the ! ; 
domly chosen leader "superior formal stard 
and “special information about the task,” ! E, 
delegating to him “the highest decision-making 
authority and responsibility.” This was ad 
here by talking directly to the leader in d 
presence of the members, presenting writ 


rut- 


al. (1970) 


gns 
" 


: z > mail 
task instructions to the leader only, mitt 
"in P Y ast 
taining body orientation and eye contact W, 


the leader," and giving an official timing de 
to the leader. In the weak position power co 
dition, the formal status of the leader was E 
reinforced, verbal instructions were address, 
to the group, written instructions were La e 
to no one in particular, and the timing deV 
was placed in the center of the table. yer 
A number of reasons suggest that this PS 1 
manipulation is inadequate. First of all, "atii 
difficult to believe that the particular exp“ i 
menter behaviors—like looking the eade? i 
the eve, and giving the leader of an a 7 
group a timing device and written inst ott 
tions—are sufficiently powerful manipulat! {0 
to do all the things the authors expect d 
accomplish. Position power is conceptual” g 
as providing the leader with some real poWe s. 
give rewards and sanctions. In other wor is 
the leader must have some fate control ove” i. 
members. It is very difficult to give high m 


i 


1 
9 
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Eee es mra 
is successfully, it was 
ge accomplished by using individuals who 
DER edet 
(Fiedler, 1966) com ubi ed po m 
recruit leaders; a un ly EE : E 3 n 
by Meuwese mud "Fiedler (Fie IM : TUAE 
a study of West Point ici d -— [em 
F ` a by Skrzy pek (see 
; Ootnote 6) used cadet ofiicers who held 
er WINE than their members. Moreover, 
“gen Gf al. (1971) demoted the leader 
appointed for the first task session to member 
Status in the second session and made another 
member of the same group the leader in the 
Second session. This procedure is likely to 
dilute the formal leadership power. j 
Graen et al. (1971) subsequently reported 
data on perceived leader influence in support 
of their claim that the position power manipu- 
lation actually had been effective. Cell means 
of perceived influence ratings are reported in 
Table 5. 
in SUM difference in means over octants 
D ne 2 was only 16. At test compar- 
à he perceived influence ratings 1n the strong 
and weak position power conditions was 1.746, 
Which is not significant. As can be seen, some of 
the weak position power octants had means 
ee were higher than some of the supposedly 
high position power octants. Specifically, two 
ae “weak” octants had mean scores of 5.38, 
: e two of the "strong" octants 1n the first 
experiment had mean scores of only 5.14. The 


Position power manipulation in Experiment 
hs appears, therefore, not to have been 
llective, 


E Distribution of LPC scores. Leaders were 
“Signed at random and LPC scores were ob- 
me after the three-man groups had a 
E c This procedure does not proper y 
ES that leader LPC scores within each 
ers will have similar means and a 
ig It is obvious that a meaningful test 
a axe HRBSOOY model cannot be oimn if, 
ead xample, all groups within one octant have 
in es with high LPC scores, while all groups 
i bier octant have leaders with very’ e 
Soy An appropriate test is an analysis ir 
Vari Nee to determine whether the means o 
lous octants are reasonably similar; if so, 

© P test will be nonsignificant. 
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TABLE 5 


CELL MEANS OF PERCEIVED INFLUENCE RATINGS 
IN GRAEN ET AL. STUDY 


Condition Octant X 


Experiment 1 


5.83 6.17 


Strong position power 
433 4.83 


Weak position power 
Maite 


Experiment 2 


Strong position power | 3.75 5.63 6.13 3.50 | 5.29 
Weak position power 5.38 5.00 5.38 4.75 | 5.13 
6 


Maitt 


Data provided by Graen et al. (1971) show 
that the means of LPC scores within the vari- 
ous octants ranged from 84.7 to 42.7 in the 
first experiment and from 77.3 to 52.9 in the 
second experiment. A one-way analysis of 
variance was performed for each study with 
the eight octants as cells in the design. The F 
ratio for the second study was not significant 
(1.17), indicating that the various octants did 
not differ in mean LPC. The F ratio for the 
first study was, however, 3.10, which is sig- 
nificant at the .01 level and indicates that the 
distribution of leader LPC scores differed 
markedly from octant to octant. In other 
words, some octants, (e.g., Octant IV) con- 
tained few, if any, low LPC leaders, while 
others (e.g., Octant V) contained few, if any, 
high LPC leaders. 

Since the position power manipulation in 
addition to the weak task structure manipula- 
tions in Experiment 2 and the LPC score dis- 
tributions in Experiment 1 were inadequate for 
testing the contingency model, neither experi- 
ment was included in the validation analysis. 


Summary of Results 

Table 6, which summarizes all correlations 
from acceptable studies, shows that the median 
tions for six of the seven octants are in 
icted direction (Octant VI was not 
). Of these, the joint probabilities of 
ations in Octants I, III, and IV are 
ow the .05 level. Also, 34 of the 
are in the predicted direction, 
the .01 level by binomial 


correla 
the pred 
predicted 
the correl 
significant bel 
45 correlations 
a finding significant at 
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TABLE 6 | 


SUMMARY OF FIELD AND LABORATORY STUDIES TESTING THE CONTINGENCY MODEL 


Octants 
Study : -— 
| ro a am | ae | g vi vil | vill 
i | i E |__| ee 
Field studies 
ind ] = _— 
Hunt (1967) | —.64 —.80 21 30 
| =I .60 —.30 
Hill (1969)* xU A ~.24 62 By 
Fiedler et al. (1969) EA] .00 ‘67* EC 
O'Brien et al. (1969) | — 46 AT —.45 "TG 
| —— 
Daun experiments "o — 
Belgian Navy = 72 37 EN .08 46 07 26 E 
m -50  —.54 Kt .03 Br] = ^ 
Shima (1968)^ —.26 A 
Mitchell (1969) 24 43 
B. | 17 38 
Fiedler exec. 34 1 33 
Skrzypek* -43 -—32 40 38 28 EG Us = 
Median 
: m 5 
All studies 17 —.22 38 22 0 .26 33 
Field studies —21 =29 23 21 —24 30 73 
Laboratory experiments -24 —.16 38 16 A3 .08 x 
"Median correlations of Fiedler's 43 
original studies (1964) —.52 —.58 —.33 AT 42 05 = 


number of correlations opposite to expected directio 
* Studies not conducted by the writer or his associa s. 
*p «.05. 


test. It should be noted that the number of 
correlations in the predicted direction would be 
significant at the .01 level by binomial test 
even if we included the two Graen et al. 
(1970, 1971) experiments. These results permit 
the conclusion that we are not dealing with 
random effects: Group performance appears to 
be contingent upon leadership style and situa- 
tional favorableness. 

On the other hand, 5 of the 10 correlations 
obtained in Octant II are in the positive rather 
than in the negative direction. These counter- 
expectational positive correlations, although 
found only in this octant, throw considerable 
doubt on the overall generality of the relation- 
ship predicted in the Fiedler 1964 paper. It is, 
therefore, essential that we examine the rela- 
tions in greater detail. 


Note.—Number of correlations in the expected direction (exclusive of Octant VI, for which no prediction had been made 
11; by binomial test — .01 


"TL 


It will be recalled that in the 1964 study 
data for Octants I, IT, and V came from fier 
studies, while the data for Octants IH, I n 
VII, and VIII came, with but one exceptio" 
from laboratory studies. Field and labor 
tory results should, therefore, be exam” 
separately. fot 

Field studies. The median correlations 
field studies are quite similar to those predict’ 
in the Fiedler 1964 paper. All of the media" 
are in the predicted direction, and 13 of the 
predicted correlations are in the expected : 
rection, which is significant at the .05 lev" 
The curve, based on relatively few studies: ; 
not as regular as that obtained in 196^ 
Octant VI was not predicted. The predicte 
median correlation of .05 in Octant VII whic? 
we based on 12 correlations in the 1964 study” 


Nem with a median correlation of .30, 
Piedad i ree correlation coefficients. Overall, 
the Mes the small number of studies and 
Studies eg of cases within each of these 
a NA the results seem rather remarkably 
the En with the 1964 data, suggesting that 
ship pe 3 is valid for the prediction of leader- 
rn formance under field conditions. 
gan road and laboratory studies. Only the 
al, (1971 avy, the West Point, and Graen et 
dicted f ) studies provided data for all pre- 
Bow se It is, therefore, hazardous to 
octants than tentative conclusions for any 
aa aec Tee II and IV for which sufficient data 
en al able. Data for Octants Tand IV tended 
eagle the prediction of the contingency 
Bitan v. they did so only directionally in 
indicate II, V, VII, and VIII. The data clearly 
predi € that the model does not adequately 
Predict leadership performance in Octant II 
aboratory studies. On the other hand, 22 
E 29 predicted correlation. coefficients 
ES the expected direction, which is signifi- 
at the .01 level for binomial tests. 
ae Sm of possible explanations for the 
most dicted results suggest themselves. The 
E appe and obvious of these might 
eadershe that it is difficult to manipulate 
and s 1p variables in laboratory experiments, 
tions E important aspects of real-life situa- 
built do not permit themselves to be readily 
Into the laboratory. 


"of 


GENCY MODEL 


ed the more 
vor- 


Extr 
XTENSIONS or THE CONTIN 


oe of studies have test i 
Bblene ypothesis that the situational fas 
Fader, SR affects the relationship between 

Ship style and performance. Some of 
Ei 4 Studies tested groups in situations rang- 
a TO the very favorable to the very un- 
‘Vorable; other studies considered groups 
uling on only two points on the situational 
BoorMlilepen continuum. The various gon 
Iter 1an were categorized as being favorab e, 
or os or unfavorable situations by ie 
ED judges. However, since the degree © 
vesti en favorableness in some of these in- 

gations was not operationally specified in 
advance, the rejection of the null hypotheses 
becomes correspondingly more hazardous in 


these cases. This is especially so in studies in 
Which the statistical relations do not reach the 
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commonly accepted level of significance. On the 
other hand, it is difficult to obtain a large sam- 
ple of groups or organizations in any one study; 
and it is essential, therefore, that we consider 
the cumulative evidence from different 
investigations. 

Experimental variation in leadership behavior. 
Shaw and Blum (1966) instructed nine leaders 
to act in a highly controlling and directive 
manner, while a second set of 9 leaders was told 
to be permissive and passive. The groups were 
given three tasks in counterbalanced order, 
and the tasks varied in degree of structure. 
Permissive, passive leaders performed more 
effectively in the two relatively unstructured 
tasks (roughly Octant IV), the directive 
leaders, as predicted by the investigators, per- 
formed more effectively in the highly structured 
tasks (roughly Octant II). These results con- 
form to the general expectations of the model. 

Objectification of situational favorableness by 
structural role theory. O’Brien (1969) measured 
situational favorableness objectively by apply- 
ing methods of structural role theory (Oeser 
& Harary, 1962, 1964). O’Brien assumed that 
a situation would be more favorable to the 
leader the greater the number of paths he had 
to the task structure. The more readily and 
directly the leader could influence task per- 
formance, the greater his influence and con- 
trol of task-relevant group behavior. 

The structural role theory deals with rela- 
ng three elements, namely, persons, 
positions, and tasks. Relations among the first 
indicate the interpersonal relationships, rela- 
tions among the positions indicate authority 
relations, and relations among the tasks indi- 
cate allocations and task sequences. O’Brien 
assembled groups on the basis of personal com- 


patibility depending upon the members' simi- 
dissimilarity in scores on Schutz’ 


larity or n 
FIRO scale (1958). The group task consisted 
of constructing models from spheres and sticks 
according to à given pattern. Position and task 
allocation. were manipulated by determining 
the means by which the leader could interact 
in the task performance. A coefficient of situa- 
tional favorableness could then be computed 
hich, it Ceo, form, expressed. the 
leader’s paths to the task structure as a ratio 
of all possible paths. The higher the ratio, the 
more favorable the leader’s position power. 


tions amo 
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The results of O’Brien’s study, based on the 
correlation between leader LPC and the num- 
ber of models produced, showed the predicted 
relationships. For 16 groups with high situa- 
tional favorableness the correlation was — .08; 
for 16 groups with intermediate favorableness 
it was .77 (significant at .01); and for 32 groups 
with low situational favorableness the correla- 
tion was — .13. 

Helerocullural American and Indian groups. 
Anderson (1966) conducted a laboratory ex- 
periment which used graduate students from 
India as well as from the United States. These 
groups consisted of one American leader, one 
American group member, and one Indian group 
member. The tasks required the groups to 
negotiate an agreement on hiring practices be- 
tween an Indian village and an American com- 
pany, and to compose two different stories 
based on the same TAT card. Half the leaders 
had high LPC, and half had low LPC scores. 
Of these, half were instructed to be as con- 
Siderate as possible in their leadership be- 
havior, while the other half were told to struc- 
ture the situation as firmly as possible. 

Anderson performed a post hoc analysis that 
scaled the situations on the basis of favorable- 
ness. This scaling suggested that the TAT 
tasks were more favorable than the hiring 
problem (which required negotiation) and that 
the considerate condition would be more favor- 
able than the structuring condition. This order- 
ing of situations was supported by leader re- 
sponses to questions about their anxiety and 
tension in each of these conditions. 

The correlations for the four tasks in order 
of favorableness (TAT-considerate, TAT- 
Structuring, negotiation-considerate, negotia- 
tion-structuring) were — .50, .21, — .22, and 
— .12, thus suggesting a curvilinear relation- 
ship between LPC-performance correlations 
and situational favorableness (us = 8). 

Chemical processing companies. Lawrence 
and Lorsch (1967) compared the performance 
of six chemical processing companies, each of 
which had four subsystems: production, sales, 
applied research, and basic research. Each sub- 
system was under the direction of a senior 
management official, either at the vice- 
presidential level or immediately below. re 
structure of these subsystem tasks was ratec 
by a number of judges. Fundamental research 
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was rated as lowest in structure, while produc- 
tion was rated as highest. This reviewer corre 
lated the manager's LPC and rated perform- 
ance from the Lawrence and Lorsch data. T 
results follow the expectation that a low TEA 
manager would perform better on structure 
tasks, while a high LPC manager would P 
form better in unstructured situations. Th 
correlations, in order of rated structure, b 
for production, — .50; sales, — .313 applie 
research, — .10; and fundamental researc» 
66. (ns = 6). , 
Since these results were obtained witl y 
reference to leader-member relations, ee 
suggest that the leader-member relations i 
mension might be relatively less important i 
higher levels of the organization. This sec 
reasonable since the manager at the third we. 
higher levels usually has very few direct E 
tacts with production workers at the i 
supervisory levels, and relatively few conta 
with first-line supervisors. and 


Psychiatric nursing organization. Nealey uri 


hout 


Blood (1968) investigated the psychi M. 
nursing service of a large Veterans Administ" 
tion hospital. LPC scores were obtained fr 1 
supervisors at the first and second levels n Y 
organization, namely, head nurses in ue. 
of a ward, and “unit supervisors" in de at 
one of the hospital’s six large units, whe? 
both quite comparable as to structure nit’ 
function. Performance of wards and of vihe 
was judged by management personnel at Es 
level above the head nurse and the unit SUP 
visor, respectively. the 
The most important difference between mit 
job of the head nurse and that of the "ie 


3 
supervisor for purposes of the present an 
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ate 
appears to be the structure of her subordin® a 
tasks. The work of the psychiatric aide 1$ rafe 
tively structured since there are fairly spe of 


Dec " t 
guidelines available on the management pe 


the work of the head nurs 
quires considerably more 
making, and it is corres 
tured. Nealey and Blood 
scores of these nursing 
performance. In a ver 


and Owen (1970) about 18 months later | 
computed these corre] 


es she supervise, 
policy and dec e 
pondingly less SU SU 
l correlated the F d 
Supervisors with T? V 
ry similar study, Ne47 


: 22) 
ations on 25 head nur? 


hey ^ 
l 
t 
5 
d 
i 
15 
1 
c 
f the 
na 
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psychiatric patients and ward personnel. n 
job of the unit supervisor, or more precis 
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TABLE 7 


SUMMARY OF 3 5 
UMMARY oF LPC-GnouP PERFORMANCE CORRELATIONS OF STUDIES EXTENDING 


THE CONTINGENCY HYPOTHESIS 


Study and date Favorable Intermediate Unfavorable 
O'Brien (1 
; 969) —.08 " 
| Denon (1966) —.50 21 " 2a xe 
ag & Lorsch (1967)* —50 -—10 -—:1$ .66 Á 2) 
Ne & Blood (1968) —.22 79" 
m aley & Owen (1970) —.50* 
Ut & Barron? 
Sem 4 —42 = —32 67 —0& =.01 as 2B 
| =A zhb- 09 AL 45 —.20 47 —61 —.14 
Median = 27 20 —.30 
es degree of judged favorableness of the leadership 


sre location of the correlation coe 
"Study ABE HD es the left, the more 
Si conducted by writer or his 

In Fiedler (19672). əy writer or his 


ssociates. 


first study. 
LPC and 
1 studies 
= 25, 
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cient in the table indicat 
ble the situation. 


quite strong. The investigations do not permit 
an exact test of a theory since the methodology, 
the criteria, and the subject populations vary 
from study to study. At the same time, these 
studies provide consistent cumulative evidence 
that the correlation between leadership style 
and group performance is moderated by 
the situational favorableness dimension even 
though this dimension is operationalized in 
a wide variety of ways. Thus, the Shaw and 
Blum experiment, the Lawrence and Lorsch 
investigation, as well as the Nealey and Blood 
and the Nealey and Owen studies show that the 
structure of the task is important in determin- 
ing the direction of the correlation between 
LPC and group performance. The Fiedler and 
Barron (Fiedler, 1967a) study used stress as 
an index of situational favorableness. The 
O’Brien experiment presented an ingenious new 
metric of situational favorableness based on 
structural role theory, and the results of his 
study yielded relations in the expected direc- 
tion with one of the correlations highly sig- 
nificant. Finally, the Anderson (1966) experi- 
ment, based on post hoc analysis of the data, 
also yielded data that tended to show an inter- 
action of task and leadership style. Thus, not- 
withstanding the diversity of the studies, there 
is clear evidence that identified situational 
components determine, in part, the type of 
leadership style that a particular group requires 
for effective performance, 
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EXTENSION OF THE MODEL TO 
Coactinc Groups 


A number of field studies have attempted to 
investigate the relation of leader LPC scores to 
the performance of coacting groups and organi- 
zations (Fiedler, 1967b). The number of studies 
reported is now sufficient so that a review of 
the findings appears appropriate. 


Task Groups and Organizations 


Craft shops and grocery markets. Two organi- 
zations studied by Hunt (1967) were coacting. 
These were craft shops in the physical science 
laboratory and grocery departments of chain 
stores. Both of these types of organizations 
performed coacting tasks that were highly 
structured. In both organizations the leader's 
position power was high. Performance of craft 
Shops was assessed by ratings, while grocery 
departments were evaluated on the basis of an 
objective measure of number of man-hours 
over total sales volume. 

The correlations. for six workshops with 
high, and five with low group atmosphere 
Scores (comparable to Octants I and V), were, 
respectively, —.48 and .90 (p < .05), those 
for 13 supermarkets with high group atmo- 
sphere, —.06, and those with low group atmo- 
sphere, .49. 

Hospital departments. Hill's (1969) study in- 
cluded various departments in a hospital, in- 
cluding the nursing service, the controller's 
department, dietetics, housekeeping, Stores, 
central supply, and maintenance. All were rated 
to be coacting, and all supervisory personnel 
were rated as having high position power. 'The 
nursing service was rated by Hill as having an 
unstructured task, while other departments 
were rated as having structured tasks. Classify- 
ing the groups on the basis of the contingency 
model classification would make the nursing 
Service fall into Octants III and VII and other 
departments into Octants I and V, depending 
on the leader's group atmosphere score. The 
correlations for the Octant I and V groups 
were —.21 (n = 8) and .52 (n = 8) and for the 
Octant III and VII groups (nursing) —.32 
(n = 7) and .87 (n = 7; p < .05). 

Telephone offices. Bates (1967) conducted 
a study of telephone supervisors in two offices 
of the Bell "Telephone Company. ‘The total 
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sample consisted of 112 operators working In 
13 groups. Of these employees, 21 were first- 
line supervisors who provided LPC and group | 
atmosphere scores. The task was rated as 
highly structured, and the position power as 
high. These groups would then fall into 
Octants I and V. 

Bates obtained two criterion scores, 
"quality index" reflecting the accuracy "b 
courtesy with which the operators handle ca 
as well as the accuracv in billing calls. A Se 
ond index, called the “load factor,” reflects the 
average number of calls handled by an offica 
Three telephone executives who were inc $ 
pendently interviewed agreed that the qualia 
index is the more important and that offic 
rarely performed below the minimum be 
factor. Since the quality and load indexes ko 
negatively correlated (—.43), only the qualit) | 


a 


index was here considered. Bates divided n. 
groups into six with relatively high E 
atmosphere and seven with low group id 
sphere scores. The corresponding correlatie ° 
between supervisors’ LPC scores and quali 
indexes were —.77 and .75. tis 

School principals. McNamara (1967) inve 
gated the effectiveness of elementary udi, 
principals in 32 elementary schools of ! " 
Edmonton, Alberta, school system. The schol 
were relatively small in size, some contain f 
as few as six teachers. Although MeNamai 
spoke of his schools as interacting, all heit 
judges classified the schools as coacting. T er 
reason was that there is relatively little iB 
action among teachers in the performance at 
their instructional duties, nor is it likely "un 
the principal's job demands that he share m 
administrative duties with his staff or teacher 
The principal’s position power was rated ? | 
being high. bY 

The effectiveness of schools was judged We 
five members of the School system's adminis? 
tive staff. Leader-member relations were o 
dexed by group atmosphere scores, with t 
upper third of the schools and the lower dur 
9f the schools used, and the middle third d 
leted from the sample. 

The correlation 
and performance in hig $ 
schools (Octant T) me a HD. eu À 
that in low group at ^ ! 
V) was ,31 (n = 12) 


— pe 
between principal’s P 
je 


mosphere Schools (€ )cta? | 
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Training Situations 


y aval aviation cadets. Fiedler and Hutchins 
(Fiedler, 1967a) conducted a study of aviation 
cadets (including some commissioned officers 
in flight training) and their instructors. The 
Student pilots were in the advanced course 
Which concentrated on formation flying, a 
rather anxiety-arousing phase of training. The 
cadets were assigned to “flights” of eight men, 
and each flight was under the direction of an 
Instructor team. The performance of the men 
in each flight was evaluated by instructors 
regularly assigned to another squadron of the 
training base. 

Two samples of 16 instructors were tested, 
pug their groups’ performance scores as 
criterion of leadership (Octants I and V). The 
Correlation of head instructors’ LPC scores and 
Performance of their flights was -45 and .17. 
Fiedler (1967a) also obtained correlations be- 
iR the sociometrically chosen men, that is, 

ormal leaders with very little position power. 
EM were sociometrically chosen, all had 
i. Cadermember relations. The correlation 
a "- LI C scores of 22 leaders ana per- 
hs E e ratings was .55 (p < .02) and .28 for 
$505 E and 15. The lower correlations in the 
3 nd set of teams may be due to the smaller 

ange and standard deviation of performance 
Scores in the second sample of teams. p 

Management trainees. A study by Seifert 
(1969) used 14 management training groupe 
In Germany. These groups; each consisting of 

3 to 30 participants, remained together for 
almost 1 year. Similar to the naval aviation 
Cadet study, each group had a team of instruc- 
tors and a head instructor. Each group ao d 
an informal leader, designated as spokesman 
9f the group, but having Ver little poe an 
Power, Similar to the naval aviation fuor S, 
Seifert renorte at the men were under con- 

fert reported tha ety lest they fail. 
Siderable pressure and pid groups as in- 
(Although Seifert treated thes! BG the Seifert 
teracting, all four judges PEN ) Effectiveness 
Study one of coacting EOS. Ve pussd o 
Was rated by instructor S and performance 
Several scales of motivation?" * 
cs of the! 


` aa men. 
Characteristi “een the LPC of head in- 
Correlations bet SEDS6 Tor those 

E j ce was 9 or those 
Structors and pe! it 20 for thse 


. 5 1 os 
With high grouP atm 


forman 
phere and 
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with low group atmosphere scores. The corre- 
lation between spokesmen’s LPC and perform- 
ance scores was .45, regardless of group 
atmosphere. 

Student nurses. A study by Reilly, mentioned 
above (see Footnote 7), was designed to test 
the contingency model. However, the judges 
rating Reilly’s method were divided on the 
leader’s position power. The study involved the 
assembly of 14 groups of student nurses with 
each group required to complete 10 sets of dis- 
cussion problems and examination questions 
posed by the faculty of the school over the 
course of the year. Some of these problems were 
highly structured; that is, there was an answer 
or a solution available. Other problems were 


highly unstructured; that is, the group had to 
ethical issue for which no 
available. Reilly reported 
between leader LPC and 

irrespective of leader 


discuss a case or an 
definite answer was 
a correlation of .63 
group performance, 
group atmosphere scores. 

The leader had responsibility for arranging 
the sessions and seeing that the group's solu- 
tions were typed and handed in to the faculty. 
She also had the responsibility to assign 20% 
of each of her fellow students’ grades. Leaving 
aside for the moment the question of position 
power on which our judges disagreed, or that 
these groups may well have been interacting in 
most tasks, the more important point might 
well be that the Reilly study, as well as the 
naval aviation cadet study by Fiedler and 
Hutchins (Fiedler, 1967a) and the Seifert study 
of German management trainees, is distin- 
guished by having as its purpose the training 
of the group members rather than the per- 
formance of some task that results in an output 
beneficial to the organization. In all of the 
studies, the exercise was designed to be 
beneficial to the individual group member. 
There might well be a psychological difference 
in leading a group for the purpose of benefiting 
the members and leading a group for the pur- 


sAn earlier unpublished study of classroom per- 
formance by Marse (1958) showed negative correlations 
of —.70 (n = 6) and —.36 (n = 12) for physics and 
rhetoric section instructors at the University of Illinois 
where rated student performance was the criterion. In 
that study, however, the class members never interacted 
in groups as part of their training. Whether this is an 
important determining factor remains to be seen. sis 
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TABLE 8 
CORRELATIONAL SUMMARY OF STUDIES EXTENDING THE CONTINGENCY MODEL 
TO Coactinc Task AND TRAINING GROUPS 
Octants 
Group T 
1 | I | nn | IV | v | vi | VII | vill 
Task groups 
Craft shops — 48 908 
Groceries —.06 49 
Hospital departments* —.21 —.32 .52 87 
"Telephone offices fT Ald, 
School principals? — 48 31 
Median —48 —.32 52 87 
Training groups 

Naval aviation 

Chief instructors 45 

A7 
Informal leaders 355 
28 

Management trainees? 

Head instructors 56 —.20 

Informal leaders 45 AS 
Student nurses^ .63 

Medians 45 .50 —.20 


* Studies not conducted by writer or his associates, 


pose of benefiting the organization. Our data 
Suggest that this may be the case.? 

Summary of results from coacting groups. As 
we suggested in the discussion of the Reilly 
(see Footnote 7) Study, the results obtained 
from studies of coacting groups can best be dis- 
cussed by separating results from task groups 
and results from training groups (Table 8). 

The results from task groups are highly con- 
sistent in showing that groups or organizations 
with structured tasks and high leader position 
power perform more effectively under low LPC 
leaders when the group climate is rated as 
favorable, and more effectively under high 
LPC leaders when the group climate 1> Ws 
favorable. Classifying coacting groups in the 
same manner as interacting groups indicates 
that the correlations are quite similar in the 
two octants for which data are available, 
namely, Octants I and V. 1 

Hill, in his study of a hospital, classified the 
supervising nurses? job as unstructured. The 


? Note, however, that a school principal’s ET ee 
bership consists primarily of teachers and clerical staff, 
not of students. 


icted 
correlation for Octant III is in the predict, 
direction, and the correlation for Ostani fo 
while much higher than would be woe. 
this octant, is not incompatible with dat 
interacting groups. 

The data on coacting /ask groups a 
that the distinction between interacting atv 
coacting task groups might be unnecesst 
while the distinction between fask groups tter 
training groups might be meum s Ja 
appear to follow quite dissimilar rules. i 

Phe Seifert (1909), the Fiedler and Hutchins 
(Fiedler, 1967a), and Reilly (see Footnote 
Studies suggest that groups in training nid 
constitute a valid subclassification for le 
ship studies, a classification that needs 49 ity 
intensively examined further, The possibi i 
Should be considered that this set of studi? 
might also include those by Mitchell (1969) a? 


Fiedler which used Participants in leadersh! 
workshops. 


gest 


gery - " l 
Recent empirical Jindings from research ol 


leadership training, Despite intensive effort? 
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improve organizational performance (Camp- 
bell et al., 1970). These disappointing results 
can be deduced from the contingency model. 
Specifically, we can interpret leadership ex- 
perience and training as improving situational 
favorableness (e.g., human relations training 
1s supposed to improve leader-member rela- 
tions; technical training would make the task 
appear more structured). Training and ex- 
perience should then differentially affect the 
performance of high and low LPC leaders. The 
contingency model would then predict that 
training for intermediate situations will im- 
prove the performance of high LPC leaders— 
but decrease that of low LPC leaders. Likewise, 
training and experience for favorable and 
unfavorable situations will improve perform- 
ance of low LPC leaders—but decrease per- 
formance of high LPC leaders. These results 
have now been obtained in several studies and 
further support the con tingency model (Fiedler, 
In press). 
SUMMARY 


A review of studies testing and extending the 
contingency model evaluated the model's pre- 
dictive power in field and laboratory situa- 
Lions and in coacting task and training groups. 

The model seems to predict leadership per- 
formance in field situations, but not completely 
in laboratory situations. The major discrepancy 
between the model and the field studies on the 
one hand, and the laboratory studies on the 
Other, was in Octant II, where the model pre- 
dicted negative correlations, while the labora- 
tory studies showed predominantly positive 
correlations between LPC and group perform- 
ance, This discrepancy, if it is not due to 
chance, may well bring to light important as- 
Pects of leadership interactions that are not 
usually reproduced in laboratory situations. 

A series of studies, extending the theory, Was 
reviewed. Taken as @ group, these studies pro- 
vide strong evidence that the situational favor- 
ableness dimension does indeed moderate the 
relationship betwee? leadership style and group 

"hg and that it therefore provides an 
Deno En r understanding of leader- 
important clue to ou a g of lea 
ship phenome es E m 

Finally, d suggested that we differentiate 
organizations Phat a aoan WC 
between groups 
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benefit of the organization—that is, the typical 
task groups—and groups that exist for the 
benefit of the individual—groups of trainees. 
Coacting task groups appear to follow the pre- 
dictions of the contingency model, at least for 
Octants I and V. Data from groups-in-training 
showed consistently positive relations between 
leader LPC and group performance measures in 
Octants I and III, and only one small negative 
correlation in Octant VI. These findings sug- 
gest a new approach to the classification of co- 
acting groups and training groups which might 
lead to a better understanding of managerial 
performance in task and training organizations. 
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I 
| 
! A refutation is presented to Banks’ argument that the invariance of Type II d^ 
pe II d’) with respect to probability correct 


in recall is an artifact. The replicability of that invariance is discussed, as are its 
theoretical implications. Finally, the meaning of a criterion in the recall situation 


| (or d*, the ““corrected-for-guessing’ 
is discussed. 

4 Banks (1970) has provided a generally ex- 
cellent review of the application of the theory 

Of signal detectability to human memory. In 
Particular, he included an extremely clear 
Synopsis of my finite-state decision theory 
(Bernbach, 1967) for recognition memory and 
Confidence ratings in recall. Unfortunately, 
Banks performed a significant disservice to his 
readers in his discussion of the application of 
the theory of signal detectability to recall. One 

) of the most theoretically important findings in 
this area, first discovered by Murdock (1966), 

is that Type II d’ (or d*, the “corrected-for- 

| guessing” Type II d") is invariant with respect 
to variables affecting memory, such as serial 
Position or trial number (Bernbach, 1967). 
Banks (1970) claimed incorrectly that Mur- 
dock’s result was merely “an artifact of his 
Measuring procedure [p. 89." . 
Three points need to be made, concerning 

(a) the relationship between Type H d' and 
the proportions from which it is derived, (b) 


[ Murdock's finding, and 


the replicability o [ 
? in this context. 


c) the meaning of “criterion x 
First, Banks noted that hit frequencies an 
Correct-rejection frequencies were virtually 
Complementary, and that that was as though 
Subjects, because responses were forced, placed 
Suesses in the lower categories of confidence. 
(The confidence-rating method for determin- 
Ing Type II d’ is commonly used, and it was 
Used for all the instances discussed in this 
Paper.) But how are any of these low-con- 
fidence responses correct, as many are, when 
the guessing probability is virtually zero? In 
my studies using consonant trigrams as re- 
dence dtm tt 

Bernbach, Depar York 14850. 
versity, Ithaca, ^ ew 


the probability of a correct response 


sponses 
by guessing is approximately 1/8000! It is 


more likely that Banks simply observed that 
hits and false alarms tend to covary; the 
theory of signal detectability tells us whether 
or not that covarying is all criterial. (Clearly, 
it is not in recognition memory.) Thus, Mur- 
dock’s finding may have been not an artifact, 
but rather a coincidence. 

If the problem is reduced to the possibility 
of a coincidence, the record can be set straight 
by looking at replications of the invariance of 
d* with respect to variables, such as serial 
position or number of presentations, that 
affect the probability of correct recall, At the 
time Banks wrote his article, two replications 
involving serial position and three regarding 
number of presentations (Bernbach, 1967) had 
been reported. The invariance has since been 
replicated by Bernbach? and by Bernbach and 
Bower (1970). An important point is that 
what replicates is the invariance of d*, not of 
Type II d'. A correction for guessing derived 
from the theory that predicts the invariance 
(as discussed in Bernbach, 1967, p. 476) was 
necessary to obtain the invariances shown in 
my Figure 14 (Bernbach, 1967). It would be a 


robust artifact indeed that would fit into a 
! 


theory in that wa) 
The third point is perhaps most important 


as regards Banks’ conclusions. The criterion 
in question here is not some point above which 
a response will be made; rather, it is a point 
above which some particular confidence rating 
will be made. (In fairness to Banks, it should 


2 Bernbach, H. A. Recognition and recall in short- 
term memory. Paper presented at the meeting of the 
Psychonomic Society, San Antonio, Texas, November 


1970. 
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be pointed out that this same confusion per- 
vades the paper by Murdock, 1966.) Criteria 
can affect the proportion of correct responses 
only if subjects are permitted to censor low- 
confidence responses. There is no criterion 
problem in short-term memory, as long as 
subjects are forced lo make some response lo 
every ilem. This means that Banks’ argument 
against forced recall should be completely 
reversed. What remains may be a problem of 
correcting for guessing (as was done by 
Bernbach, 1967, to transform Type II d’ to q* 
where appropriate). This is not a problem to 
be solved by the application of the theory of 
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1 ss ++ isa problem 
signal detectability, however; it is a probl i 

. oe y 10) 
solvable only by some specific theory 
memory or of the memory trace. 
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The measure d,, a si 
sponses and intrusions in rec 
and validity are raised. 
finite-state theory in recall without d* (i.e. 
ES : Tecoguition test the subject is shown 

a = and some old items (distractors and 
aes Es respectively), and indicates which 
uss x and which are old. Since the subject 
old a "i memory information to distinguish 
d' and new items, a detection measure (Ap, 

2 Or ds; see Banks, 1970) of his ability to 
ae old and new items is properly a 
Whil re of his memory for the old items. 
€ uncontaminated by criterial effects, the 
ee measure will reflect such factors as 
aie ne between old and new items 1n 
tive m to memory. One of the most attrac- 
theor n ps of Bernbach’s (1967) finite-state 
TN rey is that it takes explicit 
nition T the nonmemory factors in recog- 
iion [ , as expressed in the following equa- 

(P. 468, Equation 6): 
v= C= à)d*, [1] 
ure of recogni- 


Wher : M: 
„ere d' is the empirical meas g 
are theoretical 


ton Performance, and ô and d* i 
Quantities, ô being the probability of forgetting 
Ja old item and d* being à function of the dis- 

"iminability between old and new items that 
exists when matching list i tems in 
Memory, 

, An efficient method for o 


tems to 1 


btaining à detec- 
"On measure from recognition is to have y 
Jects rate each item on the recognition ^9. 
Vith their confidence that it is old, ME vum 

€ results to a Type II analysis Hus 
Pp. 82-86) to obtain an Ag or ds 2 en 


i in rec 
the Type II technique '$ used Pom Sm 
ere its outcome has 2 different mean © 


s cognition the 
that found in recognition- M By the ex 

ed by = 
old and new items are provi 


uld be 


hologY: 


sent to William P. 
! Requests for reprints sho Pomona College, 
anks, Department 9 pa 
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gnal detection index of discriminability between correct re- 
all, is discussed, and some questions about 
One of possibly many methods for testing 


its meaning 
Bernbach’s 


, d) is suggested. 


perimenter, but in recall the subject provides 
all the old (recalled) items and new items 
(intrusions) himself and then rates them. The 
d' yielded by a Type II analysis of recall is 
best conceived of as a measure of the sub- 
ject’s ability to discriminate his intrusions 
from his correct responses. By analogy with 
Pollack’s (1959a, 1959b) reasoning in treating 
message-reception data, the Type II measure 
of recognition should be called d, (discrimina- 
bility of stimuli) and the Type II measure of 
recall, d, (discriminability of responses). The 
Bernbach (1971) says is constant in 
d,. When, in fact, detection 
are made which do not use 
ries with amount 


d* which 
recall is clearly 
analyses of recall 
a Type II technique, d' cova 
recalled (Murdock, 1968). 
Because d, depends on the subject to serve 
up both the signal and noise distributions for 
analysis, I have expressed some doubts about 
the measure. In particular, with forced recall 
(forced list-length matching) and a large d; 
there is a good chance that d, will remain 
artifactually constant, or nearly so, as a result 
of guessing habits and the fact that opportuni- 
ties for intrusions are inversely correlated with 
amount recalled. (Banks, 1970, pp. 89-90). 
This artifact, if it exists, is specific to the 
response task, not to the manner in which 
items are learned or forgotten. Bernbach's 
various replications of the constancy of d, all 
use the same response task and therefore 
neither augment nor diminish the plausibility 
of the artifact. Furthermore, the a priori prob- 
ability of low confidence guesses has little to 
do with the artifact. The seeming 1/8000 
probability of a guessed consonant trigram 
being correct (Bernbach, 1971) is actually 1/20 
or less if the subject gave it a low confidence 
rating because he was unsure of one of the 


consonants. 


The question of whether or not to force 
recall is but one issue in a very complicated 
problem. (I did not simply argue against 
forced recall as Bernbach seems to think.) 
This problem has two aspects: the nature of 
the noise distribution in recall, and the speci- 
fication of the “recall criterion" used by the 
subject. The intrusions given by the subject 
in recall form the empirical noise distribution, 
and d, measures the distance between the 
mean of the intrusions and the recalled items 
on the likelihood axis. The value of d, cannot 
be interpreted as a measure of strength or 
discriminability of the recalled items unless 
the nature of the intrusions is assumed con- 
stant at all levels of retention. If the subject 
produces, intrusions more similar to the list 
items as he learns the list better, he will slide 
both distributions up the likelihood axis, keep 
d. fairly constant, and reduce measured cri- 
terion in the process (rated recall usually 
shows a lowered criterion as level of recall 
increases; see Bernbach, 1967; Bernbach & 
Bower, 1970; Murdock, 1966). While some 
questions about the noise distribution might be 
resolved by, for example, using the intrusions 
appearing in recall at various levels of retention 
as the new items in a recognition test given to 
another group of subjects who learned the 
same material the problem posed by the 
recall criterion is not so easily attacked. The 
theoretical distributions underlying d, depend 
importantly on. the criterion the subject is 
assumed to use in deciding whether to emit 
a response (Clarke, Birdsall, & Tanner, 1959). 
Even forcing recall will not ensure equivalent 
response criteria across conditions because as 
recall increases more of the list-length match- 
ing can be taken up with high-confidence 
responses and more low-confidence ones can 
be edited. Although it may seem that some 
ingenious solution to the recall criterion prob- 
lem could be found, equating response criteria 
is not solely a practical problem. To postulate 
a recall criterion dividing strong, recallable 
items from weak, nonrecallable ones requires 
a unidimensional strength theory of recall 
which many investigators would not want to 
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assume. In the context of such a theory; dr 
could be a valuable measure, but it requires 
too many assumptions to be a purely descrip- 
tive index of performance. 

I have suggested that any conclusions draw? 
from rated recall should be supported by addi- 
tional evidence (Banks, 1970, pp. 90, 91). 
Bernbach's theory is not tied to dn, and M 
view of the present doubts about the meaning 
of d, it seems he would be wise to turn t? 
tests of different predictions of his theory ™ 
recall. An obvious prediction, not put te E 
test as far as I know, is contained in Equation 
1. If percentage (forced) recall estimate 
(1 — 6), then the recognition d’ for one group 
of subjects should rise linearly with the rec! 
of another group in the same learning t255: 
The 1967 statement of Bernbach’s theory seem 
to predict, in addition, that the slope 1 
linear function is d* (i.e., d,), but this prec 
tion would probably have to be modifie " 
include a variable expressing the similar! ü 
between old and new items on the recogniti? 
list. 


of this 
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Two-choice probability lear 
formulations of stimulus-sampl 
stimulus-sampling models in bin 
and an attempt is made to doc 
trends are examined in research 
tions, discrimination, and age. A 
to the no-memory, conditioning assump 
subjects approach the prediction task in 
often relying on prior even 

describe these aspec 
framework, is discussed. 


bear 


Es aie over à decade of research in 
“There ity learning, Estes concluded in 1964: 
it is € seems to be little room for doubt that 
of Possible to abstract from the phenomena 
aged learning evidence for the 
fo asive operation of a relatively simple 
tm of conditioning, or associative learning 
ii - [p. 121]?? The form of associative learn- 
f; 8 Estes proposed was based on the now 
amiliar assumptions of Estes’ pattern model: 
(a) the two-choice situation can be represented 
by an assembly of N mutually exclusive 
Patterns of stimulation each of which becomes 
Connected to predictive responses on an à -or- 
, yen basis; and (b) exactly 
E by the eb ud 
ability 1/N; (c) the pro» 
Pattern add es a eal becomes condi 
tioned (if not already so) to the response onn 
lorced on that trial is a constant, Gj. OVER E a 
Estes’ interpretation was immediately p 
lenged by Anderson (1964); who note a 


Statistical models tł assume random sam- 
z gratitude to James 
press her A n 
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ument the direction of current research. 
ing on sequential variables, memory, 


number of studies ir 
tions of several statistica 


ts in the sequence. C 
ts of behavior, often in 
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Jniversily 


the classic situation in which to test 
Failures of several assumptions of 
ailed in this review, 
Recent 
instruc- 


n these areas suggest, contrary 


tion studies are det: 


1 models, that 


an active, hypothesis-testing manner, 
urrent thinking that attempts to 


a cognitive or information-processing 


s cannot handle the 
lming evidence that subjects respond 
s in a binary sequence. Citing 
hedules in which the event 
al on the outcome of 


pling of stimulus element 


overwhe 
to prior event 
evidence from sc 
on trial 2 is condition 
trial » — 1 (Anderson, 1960; Engler, 1958; 
Hake & Hyman, 1953), Anderson (1964) 
pointed out, “Tf subjects have memory traces 
of past events when they receive a conditional 
schedule, then it can hardly be doubted that 
they have memory traces of past events when 
they receiv a frequency schedule Ep- 182” 

To a large extent the evidence since this 
controversy has favored Anderson’s viewpoint, 
and the area has witnessed the decline of sim- 
ple associative no-memory statistical models 
such as the pattern model and its non- 
Markovian precursor, the linear model (Estes 
& Suppes, 1959). Theoretical attention has 
been redirected to the role of memory and 
the subjects’ use of sequential as well as other 
discriminative cues in predicting completely 
random (frequency) schedules and various 
nonrandom event schedules. 

The purpose of the present review is to trace 
the development of research in sequential 
learning from the probabilistic orientations of 
the early 1960s to the current concentration 
on sequential processing. Emphasis is pri- 
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marily on empirical developments as they re- 
late to emerging theories of sequential pre- 
diction. The majority of this empirical work 
has focused on the two-choice prediction 
paradigm in which a subject is required to 
predict with response A;(i = 1,2) one of two 
events (E;; i = 1,2) on each of a series of trials. 
The review, therefore, limits itself to studies 
in this two-choice context, except where varia- 
tions may offer helpful interpretations. 

Theories that have addressed themselves 
to sequential prediction paradigms are of two 
broad types: cognitive and stimulus-response 
conditioning models. Few models in either 
category have sufficient flexibility to span the 
entire range of data that is reviewed in this 
article. For this reason, the relevant. models 
are briefly discussed in their respective empiri- 
cal contexts in order to provide a framework 
for the research. Detailed theoretical exposi- 
tions, however, are beyond the Scope of this 
article. 

Organization of the review has followed the 
direction of research in Sequential prediction 
since 1960, Because several reviews (e.g., 
Estes, 1962, 1964; Weir, 1964) of early work in 
probability learning exist, redundant dis- 
cussion of many works was avoided here except 
when mentioned briefly for appropriate his- 
torical continuity. The topics that are con- 
Sidered fall into the following categories: 
sequential variables, memory, instructions, 
discrimination, and age. A large body of re- 
search relating probability learning to motiva- 
tion and incentive is omitted; the nature of 
this research Suggests a separate treatment 
of it. 


SEQUENTIAL VARIABLES 


Frequency Schedules 


Early work in probability learning (Grant, 
Hake, & Hornseth, 1951; Humphreys, 1939) 
presented subjects with frequency schedules 
in which two events were randomized over 
the entire series of trials with the sole restric- 
tion that the more frequent of the two pu 
a certain proportion of the time (7). In 1953, 
Hake and Hyman introduced conditional event 
schedules in which the probability of an event 
on trial n was dependent on the event m 
rence of the previous trial. More recently, t S 
control of higher-order conditional properties 
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(e-g., runs of events) has become common. AC | 
cordingly, this section considers research on 
these different schedules separately. f e 

Frequency theories. Explanations for ES 
dictive performance with schedules in W F E 
cues associated with prior events pe. 
same correlation with subsequent veinfortue 
events are numerous (Anderson, 1960; B 
Goodnow, & Austin, 1956; Estes, 1959; m 
& Suppes, 1959; Feldman & Hanna, T 
Gambino & Myers, 1967; Lordahl, E d 
Restle, 1961) and often difficult to test ore 
with frequency schedules. Estes’ piaus are 
models are an exception in that they The 
addressed primarily to random schedules. j 
linear statistical model assumes that PO 
a linear function of the response probabit! 
on the previous trial: 


P(Ays,1) = (1 — 0)P(A,) + Om 


-] & n r. 
where @ is a learning-rate parame at 
pattern model, described earlier, pred 
similar learning curve: 


C 
PAg) ( = EPan) +o 


The 


t 
$ 
| 
at f: 
Both models predict P(A) asymptotes ^ 
Predictions of the two differ with reap id 
asymptotic response variance and m aU o. 
statistics that reflect the probability © ina- 
sponse after different response-event gar 
tions, P(A;|E;A;), or after event T varies | 
length 4, P(A;| EJ; i, j, k= 1, 2; VAN, 
from 0 to the total number of consecut" i 
events. Both models predict that the B Y 3 
ability of an A; response should increase Ges | 
successive occurrences of the E; € fairs 
positive recency), but the pattern model. tia 
somewhat better in fits to certain seque tet; 
predictions (Friedman, Burke, Cole, ee 
Millward, & Estes, 1964; Suppes & Atkin 
1960). hat 
Restle’s (1961) schema model assumes t cy A 
subjects base their Predictions on the salien is 
and frequency of occurrence of homogen©? 
runs of length, 7i. The model can predict c | 
tive recency and, in Some circumstances, PO”, 
tive recency. Gambino ànd Myers (1967) ale 
suggest that subjects respond to run cues; ee ait 
subjects are assumed to 
of a given run length 
tional gradient to oth | 


A sation 
Seneralize predicti? E. 
according to a bidire 
er runs along the ru! 


F 


length continuum. This model, among others 
(e.g., Lordahl, 1970), has not been directly 
tested with frequency schedules. 5 

Several theories assume that subjects re- 
spond primarily to the sequence of immediately 
prior events (Anderson, 1964; Feldman, 1963; 
Feldman & Hanna, 1966; Friedman, Carterette, 
& Anderson, 1968; Lordahl, 1970). Anderson’s 
model assumes a subject integrates information 
from prior trials according to a weighted 
averaging process in which the weights are 
independent of outcomes of previous trials 
(i.e., path independence). Theories of Lordahl 
and Feldman and Hannah both attempt to 
define subsets of events that the subject uses 
to test hypotheses. Assessing the subject's 
Verbal descriptions of binary patterns (length 
eight), Lordahl determined 39 unique hy- 
Potheses which placed different emphases on 
the subjects perceptions of runs, symmetry, 
and repeating patterns and so could be clas- 
sified into seven categories. Feldman and 
Hanna (1966), on the other hand, assume that 
Subjects generate and test hypotheses about 
ra event sequence on each trial, retaining 
those that result in successful prediction. Sub- 
Mem are assumed to respond to blocks of prior 
i» comes _ as discriminative states defined 
y conditional probabilities with subsequent 
events, A modified Foulkes procedure (1959) 
Permits assessment of states up to lengths of 
five, as well as predictions about the order in 
Which states are discriminated- 

More general, and less precise than the 
foregoing theories, are the problem-oriented 
ormulations of Bruner et al. (1956). Subjects 
are viewed as engaging in a sequence 9 ge- 
cisions (i.e., a strategy) in order to hoc 
Valid cues in the probabilistic predictos we à 
Three objectives control the subjects, Y oun 
gies: (a) maximizing information ger c1 
each prediction, (6) minimizing zu CU ef 
9r cognitive strain, (c) minimizing 


: :m , d by 
fa : " the difficulty pore y 
Pa p 230), Ger dip Me ee may 


s 3 id cues among runs 
Shift from searching 


o: uin relative ency as à cue to 
sing relative 

problem solution in 
Frequency research. 

present no clear-cut supP' 

but do offer d 

research centerec a 


a frequen? 
Frequency schedule data 


ort for any one theory, 
Ct ties for a few- Because this 
ifficu ( testing predictions of 
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linear and pattern stimulus-sampling models, 
the variables controlled have related to 
theoretical questions about the role of rein- 
forcement, independence of path, and stimulus 
variability. For this reason, variables of con- 
cern to an information-processing approach 
have not been manipulated, and the role of 
memory and cognitive capacity can only be 
inferred. 

The matching of P(A1) to 7 and shifts of 
P(Ai) with v in random schedules have been 
interpreted as support for statistical learning 
models (Estes, 1964), as has the finding that 
first-order conditional statistics rank order 
with P(A) highest following a correct Ei 
prediction and lowest after a correct E» pre- 
diction (Friedman et al, 1964; Suppes & 
Atkinson, 1960). However, the assumption of 
the invariance of @ with 7 has proved inappro- 
priate for the linear model (Estes & Straughan, 
1954), and the discrepancy in estimates of the 
arameter from marginal and condi- 
tional statistics has proved difficult for the 
pattern model (Atkinson & Estes, 1963; 


Suppes & Atkinson, 1960, Chapter 10). Suppes 
and Atkinson, using 7 = .60, found that the 
ted from asymptotic 


learning parameter estima! 
first-order conditional data produced too 
rapidly accelerating marginal learning. This 
suggests either that learning rate is not con- 
stant over trials or that subjects may be learn- 
ing two separate problems, one based on sim- 
ple event frequency, and another based on 
sequential properties. 
Further, the positive 
statistical models has been long known to be 
a sometimes phenomenon. Negative recency 
first documented by Jarvik (1951) and sub- 
sequently by others (e.g., Anderson & Whalen, 
1960; Nicks, 1959; Suppes & Atkinson, 1960) 
is difficult for statistical models. The existence 
of a “the other event is due" phenomenon 
Ge, the gambler's fallacy) with successive 
reinforcements indicates that subjects are re- 
sponding to more than the immediately preced- 
ing event. Attempting to show that negative 
recency is a result of preexperimental response 
biases, Friedman et al. (1964) presented sub- 
jects with 288 trials of a m = .80 schedule 
(randomized over 48-trial blocks) after over 
800 trials of a variable z frequency schedule. 
'The resulting asymptotic positive recency, 


learning p: 


recency predicted by 


156 


often cited as Support for statistical models, 
Was assumed to reflect the true state of learn- 
ing in subjects after removal of the masking 
effects of preexperimental biases with extended 
training. Others (Edwards, 1961b; Feldman, 
1959: Lindman & Edwards, 1961) have also 
reported positive recency. However, lending 
Support to Anderson's (1964) comment that 
different frequency schedules may simply train 
in appropriate recency behaviors is a Study by 
Jones and Myers (1966). Using two 7 values 
(.60 and -75), they compared predictions of 
Sequences in which 7 was randomized over 20 
trials with those ‘in which z was randomized 
over 300 trials, Randomization over larger 
blocks resulted in higher mean and variance of 
run lengths, which in turn produced higher 
P(Aj) values and more positive recency, 
Using a lower r value (.50), Friedman et al. 
(1968) also found little 
recency predictions of st 
with extended training ( 
for 5 weeks). Sequenti 


average model provided dat 
those of Stimulus.s i 


Foulkes analysis Técommended by 
and Hanna, 


the models 
assumptions were ques- 
tionable because the subjects showed differen- 


tial Tesponding to certain subsequences (i.e., 
alternations) suggesting that groups of events 
may be coded together in short-term memory, 
And wide individual-difference weightings 
derived for these subjects indicate that coding 
schemes in short-term memory are not uniform 
with the highly uncertain + = .50 schedule. 

Recent work bearing on the subject’s en- 
coding of random events is offered by Vellott 
(1969). Yellott compared the N-element 
pattern model with a modified two- arameter 
version of the simple linear model under two 
conditions of noncontingent success rein- 
forcement (ô = .80; 1.00, where à is the prob- 
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ability that the event on any trial agrees br 
the prediction of that trial). Several wie. 
frequency schedules @= 2, 5, 8) ane, 
the noncontingent success schedules for Mes 
Subject. Because the pattern model s 
conditioning occurs only on incorrect. br on 
while linear models assume the probability 
conditioning does not change as a funchon 
prior sampling (unless 0 is very large ‘that 
infinite Population), Vellott anticipated Ee 
Sequential statistics under different ES 
contingent Success schedules should d 
nate between the two types of models. alter d 
tion response Proportions after a sequenc the 
successes did not decrease as predicted a dl 
linear formulations, but remained pei 
with both à = .80 and 1.00, thus suppor E 
the pattern model, Unexpectedly, the A ue 
ment model nicely predicted the learning pe 
for noncontingent success (8 — .80) trials v ie 
Parameters estimated from sequential s 
tics, a finding not common with noncon ena a 
event schedules (Atkinson & Estes, 19 C 
However, with ô = 1.00, the observation Ks 
response dependencies (“superstitious Si 
tions”) could not be handled by either mo a 
Because the dependencies typically e 
sisted of patterns or event runs, periocs 
and aperiodic, it appears that subjects aen 
to encode frequency schedules in terms of su 
subsequences. ai 
Related to the finding that the subject oa 
respond selectively to prior event patuere m 
Anderson's (1966) attack on the linear mode » 
description of the stimulus as a fluctuatine 
population of elements, In a series of exper! 
ments, Anderson extended an earlier study e. 
Burke, Estes, and Hellyer (1954) in whic" 
some or all of a Stroup of signal lamps initiate 
i Prediction trials. In a? 


lamps) activated on c. 
Prediction that the d 


PAins|Ey,,) = 


ifference between 
PAS ial Es. = k0, 


umber of signal lamps lit on 2 


where k, the n 
given trial, was varied CIA -.8)as? 


Within-subjects? Variable. Using Severa] va]ue$ 
of c (.50, -65, -75) and methods of grouping 


pe 
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ee or in constant groups of three 
Fa tive eee found little reliable support 
and the seri qiio ws eit . 
= PAn ngil Eon); diia is voa Bud 
usually between consta nd RM 
tondifiens with the pet ane variable signal 
smaller ee ; m atter groups exhibiting 
Stimulus eus g tates; thus, the concept of the 
single ele s as an unordered population of 

But ments appears inappropriate. 
fo s and complexity of binary 
quency s 3 x hich the subject responds in fre- 
scriptions redules seem to render simple de- 
Tesearch | a This is emphasized in 
extrapolated ordahl (1970) in which subjects 
mined owe doge events from randomly deter- 
concluded. th patterns of length eight. Lordahl 
Weighted | Da as many as 40 difierentially 
plain the ny potheses may be necessary to ex- 
to a pre the gad s pattern-oriented approach 
of this diction task. But a direct application 
adigms pi seg to probability learning par- 
sented H ere trials are paced, events pre- 
Peated) enr gee and subsequences re- 
and eiie; d ignore limitations of memory 
subject's ru reinforcement schedule on the 
Potheses Ss apud to generate complex hy- 
that He E d does reinforce the notion 
Subsets or cdi binary sequences idi 
may vary ee the complexity of the coce 
Finall with task parameters. 
Pects Ww research on the perceptual as- 
the stin probability learning casts doubt on 

ree fo eer AE. assumption that 
each til es a stimulus element or pattern en 
alors al and that this is conditioned in E 
Millward 4 fashion on some trial. Rabe eed 
Quenc rd (1968) presented subjects with E 
md Schedules (r = .80) in wma. bs 
or 2 S responses were required TOF aus 
SAEN fast-observation trials 
abilit i In 500 subsequent E ams 
5 Dd learning trials (4 seconds Pe nificantly 
u on groups did not diffe? asymptotic 
«appropriate control groups in asymp 
Estonce rosor ons ordin rub statistics. But 
in5 proportions Or in run 97. tional on 

of the "Hle statistics CO! : 

Outco he 16 possible sta "^: 11s, observation 
grou as of four previous oe unfortunately, 
he Ips did differ significant | ""Mirection the 

he authors do not describi the" i 
difference. DO despite tbe fact that they 
Occurre Mona Men patterns (e.g. 
d after relatively § 


a sub- 


alient 
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double alternations). However, the general 
comparability of results in the fast-observation 
situation with those in the standard probability 
learning situation lends support to the Reber 
and Millward contention that there is a 
cognitive component in sequential learning 
that the conditioning assumptions of statistical 
learning theory ignore. 

In summary, extended training does not 
unequivocally result in the positive recency 
predicted by statistical learning models. 
Recency effects are a function not only of 
initial expectancies but also of the particular 
frequency schedule, with high m values and 
randomization over large blocks of trials en- 
couraging positive recency. With lower 7 
values, extended training may eliminate nega- 
tive recency responding for some subjects, but 
tendencies to rely on prior groups of events are 
apparent in individual strategies—findings 
also not predictable by either pattern or linear 


models. 
Research on 
that even in this comp 
jects can remember groups of events both 
initially and after extended training; (D) sub- 
jects code the events into patterns of runs and 
alternations, although with frequency sched- 


dules the particu chich the 


lar solution on W 
subject seems to stabilize may vary widely 
among individual su 


bjects; (c) subjects do not 
sample and condition from 


a population of un- 
ordered stimulus elements on 


each trial; 
rather, learning has a large perceptual com- 
ponent suggesting that if active sampling 
occurs it must be a rapi 


frequency schedules suggests 
lex situation (a) sub- 


d covert cognitive 


process. 


First-Order Condit ional Schedules 

Conditional theories. With the exception of 
the simple linear model and the pattern model, 
most of the models described in the previous 
section can be discussed in the context of 
constrained schedules. However, an extension 
of the linear statistical model for discriminative 
situations (Burke & Estes, 1957) replaces 
simple statistical models presented earlier. 
Burke and Estes’ linear trace model assumes 
that one set of elements or traces is available 
for sampling following E1 events, and a second 
set is available after Es occurrences, and these 
traces enter into the conditioned stimulus on 
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any trial. The theory predicts that repetition 
responding, P(A,.u|E;.), on conditional 
schedules should approximate (plus or minus 
Correction terms) the conditional Stimulus 
ifications, P(E; n41| E;n). 
m enis research. Since Hake 
and Hyman (1953), one common means of 
varying the statistical Structure of binary 
Sequences has been manipulation of first-order 
event dependencies, Pisa] Es, ‘ 
holding marginal event frequencies constant, 
Engler (1958) varied P(Ei.a|E;,) as a 
within-subjects’ variable (.25, 40, .75, .90) 
over 120-trial blocks wi 
trials (.75, 


usually 


igi Ein), a 
with Burke and Estes? 
lal dependencies affected 
rate so that when x changed, 
learning was faster if the dependency structure 
remained unchanged, 

The studies of Hake and Hym 
Engler (1958) were followed 
(1960) parametric evalu 
schedules with 7 = 50 


varied from .1 to 1.00 in .1 increments for 300 
trials. P(A,, |E;,) again followed first-order 
event properties exceeding the objective values 
Particularly at higher conditional values, 
Neither the Burke and Estes model nor a 
model that assumes the subjects condition 
either repetition or alternation responses 
could describe the Subjects? tendencies to 
alternate more than the Objective event se- 
quence with lower conditional values, The 
Study not only demonstrates that persistent 
recency effects observed in frequency schedules 
are a function of Sequence structure, but it 
also shows that subjects respond to event 
ternations as readily as to event runs. 
Transfer performance of subjects pre- 
sented with these conditional schedules (300 
training trials) revealed that when shifted 
to a PE; 4|E, y = .50 schedule, subjects 
change repetition responding in the direction 
of the transfer schedule but do not approach 
the asymptote of Subjects trained on this com- 
pletely random Schedule, Only groups receiving 
initial schedules with P(E;,|E;,)- 1.00 


an (1953) and 
by Anderson’s 
ation of conditional 
and P (Erw Eray 


al- 


or 0.00 demonstrated rapid response shifts z 
approximated response proportions of — 
receiving P(E;,,,|E;,) = 50. This i a 
patible either with the notion that S € 
trained on partial reinforcement schedu eS E 
less able to discriminate schedule shifts or ee 
more resistant to extinction than subjec 
trained on continuous schedules. "s 
Other transfer studies (Goodnow & y 
grew, 1955; Goodnow, Rubinstein, & Lu d 
1960) have used first-order conditional data 
tions to manipulate run structure. The M 
Suggest that similarities of training and toot 
Sequences in terms of run lengths determines x 
direction of transfer, This finding is relevan ue 
Witte's (1964) Study in which P(Einsl Bi 
Was .65, .50, or .35 for three groupsin a eee 
Schedule with z = .50 for 300 trials. Over 5 
transfer trials, where P(E niil Ern) d 
for all groups, repetition responding mene 
50, a finding consistent with the Burke des 
Estes (1957) trace theory but not with Res the 
(1961) model. Neither model could handle ir 
run statistics in transfer, In fact, the st y 
data revealed long-lasting effects of ea in 
training; subjects who saw many long gun 
training still showed positive recency over vet 
transfer trials. Witte’s randomization of 7 0 en- 
blocks of 300 trials in transfer may bave UN 
couraged such a result. Such randomiza nt$ 
would produce more long runs of both jit 
than a Sequence in which marginal pants 
was more constrained, thus making bw 
and transfer Sequences relatively less Em ' 
criminable in run Structures for subjects p 
P(Ec 4| E; n) = .65 in training and so St 


Ham ese 
taining the early response patterns of th 
subjects. 


To summarize, Subjects’ repetition respond 
ing is roughly related to first-order sequent 
Properties ip that P (Airnya Eim) art” 
follows but exceeds PEinalE;,,). But t 
dependent variable does + 
the subject 


M o ints: 
* Conditiona] restraint 
es 


„ole 
f de Play an important 10 
In mediating transfer. It is fc por 

the trace-cond) 


A a 

us or this reason ts 

ming theory has l 

' ace idit 7 > has proved 

adequate, finally, But Restle’s assun! tion th 

Subjects rely on accurate memories f po 
'"s 0 


— —— 


"m 
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d 18 us inadequate. RE slightly over- 
Engler (1958) and. responding for data of 
grossly a dm Goodnow et al. (1960), but 
Tecency Miei i ori di responding and 
Unie rves in Witte’s long-term study. 
T si es ed these conditional analyses 
ing emm y negative evidence for condition- 
ing i offering. few clues to promis- 
Seed to a e" formulations. Schedules de- 
ieunes dag assumptions of conditioning 
chological ay Ages so obscured other psy- 
well as dis Ver mien that the experimenter as 
Processes su ges must guess what is going on. 
Srütture ops iis hypothesis testing of run 
extreme j^ t s exist, probably operate at an 
certain a a in these relatively un- 
to assume mer The solution, however, is not 
testing exi at no memory or no hypothesis 
stances b "yr in these complicated circum- 
ess I fon is control such processes first in 
plex sequences. 


llighey. 
igher-Order Conditional Schedules 


dp to isolate the relative importance 
predictive even subsequences 1n probabilistic 
of higher- tasks have led to more direct control 
theories ra sequential properties. Relevant 
earning all into three categories: (a) simple 
ition ed that assume the subjects’ con- 
Certain | her to prior event sequences or to 
&M We f n subsequences: (Gambino 

ple frac 1967; Vitz & Todd, 1967); (b) 
ade Cir e or cognitive-processtn 
1966: M runer et al., 1956; Feldman & Hanna, 
ürchic: yers, 1970; Restle, 1961); (c) hier- 
ical models that assume the subject learns 


or : 

T perceives certain subsequences initially and 
1 forms of the 
1967; 
1969; 


sim- 
ssing 


& 


» Wei arebinski 
1965), eel d “all these models 
aye made relatively 8 assumptions 
pus sequential prediction, ir 
pto Sons of the models are 
ee tests are often spec 
S of sequential constral 


ific to particular 
nts. All assume 
some way to 


that 

the sects respond ! P 

Sroups or oe aei n e of events na sug 
: subsequences OF fes as units is 


jects n MS eT 
S respond to subsed E : 
demonstrated in a studY of Ludvigson (1966) 
3 i è extinction as a 


* H ^i 2 
who examined resistance to 
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function of pattern length (1, 3, 7 successive 
homogeneous events) and number of pattern 
repetitions (3, 5, 7 repetitions). Number of 
repetitions of the pattern unit, and not pattern 
length, increased resistance to extinction. 
Direct manipulation of event runs in a pre- 
diction task resulted in sequences G= .75) 
composed primarily of run lengths less than 5, 
lengths between 5 and 11, and lengths greater 
than 5 (Derks, 1963). Derks confirmed that 
negative recency effects are specific to se- 
quences composed primarily of short runs and 
that gambler’s fallacy responding disappears 
with experience with long runs. This finding 
indicates that the processes determining 
recency responding under various types of 


schedules are similar; the degree of control 
ith random and 


over such processes is less wi 

first-order conditional schedules. Control of 
the mean and variance of the distribution of 
runs has further clarified recency effects. 
Gambino and Myers (1966) combined high 
(1.5) and low (4.5) mean run distributions 
factorially with three levels of run variance 
(low, medium, high). Increasing mean run 
length and variability increased repetition 
responding and positive recency. 

This behavior seems consistent with run 
theories such as Restle’s (1961), but in 1966 
Restle revised his schema model as a result of 
irect test. He constructed two sequences 
.50) with run lengths of two and five 


(r = 
only, with P(E;| EE) either .50 or .286 in 


nts. In a second experi- 
ment, he compared four run-length combina- 
tions: (1-4), (2-3), (2-8), (6-9). In both ex- 
periments, responding at uncertainty points 


(e.g., after two consecutive events in the first 
as found to be a function of the 


n lengths, not the ratio as his 
Restle suggested instead 
that subjects either discriminate run lengths 
and match to the conditional probability of the 
next event at uncertainty points or they fall 
back on first-order conditional information and 
simply use the preceding event. Such an ex- 
planation can account neither for the over- 
shooting of first-order conditionals (Anderson, 
1960; Engler, 1958; Gambino & Myers, 1966; 
Witte, 1964) nor for the observed imperfect 
responding at certainty points (Gambino & 
Myers, 1966; Restle, 1966). 


one of two experime 


experiment) w 
sum of the ru 
1961 theory implies. 


160 


MARI RIESS JONES 


Gambino and Myers (1967) tackled these 


inaccurately 
run termina- 
to other run 
ncertainty, a 
results in de- 
native ability, 
h Restle's data 
from variations 


Myers, 1966). Increased vari 


restimate longer 


g everative errors), 


Support for 


combinations, breaking down 
ily with Sequences having many single 
alternations. But the estimated learning-rate 
parameter again failed to be invariant across 
conditions, and y did not vary as expected with 
the mean and variance of the run distribution. 
Typically, errors at certainty points were a 
function simply of the number of run con- 
tinuations between the two run terminations 
with the 1-5 sequence producing more errors 
than the 4-5 sequence. Repetition responding 
at uncertainty points was not affected by run 
variance, but decreased Systematically with 
decrements in the proportion of long runs. 
Two related questions confront the run- 
length models: Is the assumption that runs 
are the Primary discriminative unit correct? 
What types of relationships among these units 
are learnable and how are they learned? 
Regarding the first question, several studies 
already reviewed testify to the potency of run 
cues. Another demonstration is offered by 
Halpern, Lantz, and Schwartz (1969) in which 


PUD: Cues were pitted against tonal cues as 


event predictors. 4 
fectly correlated with event occurrences, E 
certainty being introduced by ew 
discriminability of the two cues (A a 
= 0, 5, 9 deviations in decibels [sound ee. 
sure level] from a standard of 70 decibels) A 
six different run-length conditions me m 
Sequences and structured Sequences pep. A 
mean and variance of run length with v = Pe 
The presence of run cues decreased pedia en 
accuracy in the structured sequences 1 
auditory cues were highly discriminab e. 3 
Sequences were random (randomization HE 
cedure unspecified), the subjects relied 9m aci 
criminable auditory cues and were more fea 
curate than subjects experiencing e. x 
sequences at points of uncertainty in be to 
quence. It appears that subjects learr a 
attend to run cues with structured gee 
and cannot switch attention at points in 
Sequence where run cues are unreliable. dto 
But to what extent can subjects atten a 

reliable sequential cues other than A 
Strub and Erickson (1968) and Erickson @ " 
Cuchural (1969) examined subjects’ ability e7 
use second-order conditional information. Th D: 
controlled marginal and first-order event ne 
tingencies (.50) using two levels of second-or 72; 
conditionals CLE; E; s] = .92 or i 
5 f= 1, 2). In both studies, increased i 
curacy reflected the subjects’ use of the high?" 
level contingency, but with hneopntingye 7 
levels subjects tended to fall back on the pd 
vious event evidencing repetition responding: 
The fact that the run structure covaries ig 
second-order conditionals makes it difficult, E 
the authors pointed out, to determine to whee 
extent the subjects are indeed responding t0 ¢ 


single cue, two trials back, or to resulting ru? 
patterns, , 

Wolin et al, (1965) Presented three group? 
9f subjects with binary Sequences generate 
by different. Sources such that the sequence? 


contained subsequences of Simple patterns: 
Rules that insured accurate predictions could 
be derived from the various subsequences! 
these were of the form “never an E, after a? 
E? mber of Eys betwee” 
two Ess,” ; - Subjects tended to Tes 
produce Sample subsequences, responding pu 
marily to Yomogeneous runs rather “than 
Statistical rules, However, there Was son 


: ; I- 
Auditory cues were pe 


pr” 


evi 1 s 
ed s rp were responding to 
Prodi trials ermiued run lengths (i.e., switch 
a a, continue on even trials). A second 
binary "€ introduced a parallel between 
st vo eiie i and AAB) and “words,” 
random M rules generating partially 
substitutes fo contingencies thus serving as 
Sencies defini 3 EHURTAGOH rules. Contin- 
ROR ome ae i were rapidly learned, as 
rules, sabia foe rules. With more difficult 
tequencies ia "icc inii by matching “word” 
“words.” ^: not accurately anticipating the 
verbals nis no case could the subjects 
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oa Vemm contingencies studied include 
àn event dan m m which the probability of 
(Young 1967) ; as a function of trial number 
ules spannin 2 aon complex conditional sched- 
1966). moe trials (Feldman & Hanna, 
plex Peers and Hanna generated a com- 
Foulkes vase sequence using a modified 
1,000 et ar a Subjects’ predictions over 
spond to tint trials indicated that they re- 
defined by ONDE well as run contingencies 
tive states eoe properties as discrimina- 
Which sik eie dee model. But the order in 
unction of st; p the states, which is some 
andled by t n cpi was not completely 
Nose of-ths aut hors descriptive model. 
Problem of - studies directly confronts the 
Criminative etermining the primary dis- 
Sübject ue unit, if one exists, used by the 
varied i dit suggest that while subsequences 
ment ws 3 ificulty for subjects, the reinforce- 
the ae 3 side itself strongly determines how 
Predict n used different subsequences. 
recall e is paradigms involving pei 
to this, &y have greater potential for an ? 
Ti assume with Gambino 9 $c k 
Tod g others (Restle, 1961, 19675 Wee 
à d, 1967, 1969), that as à first approxima 
jects d oe pa ene runs is a 
andi b mak expend s pot generality. 
icap for a theory striving ^5, varied th 
Butler, Myers, and Myers (1969) varied the 
Proportion of ti ye One length was 
repeat. on of times a run ee 50; 78) in 
fires ` successively 5 d 5 Son components 
(2: ^ wu dienn T ancy G ae 
nificz -6, 2-8) Run tition responding at 
cantly affected repet! 
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uncertainty points such that with 4-6 and 2-8 
combinations, repetition responding was mini- 
mal for the ņn — .75 condition and maximal 
for y = .50 condition, suggesting that these 
subjects were responding to runs of runs. As 
with the Myers et al. study (1969), persevera- 
tive errors did not decrease with decrements in 
run variance; 7 did not affect perseverative 
errors, while run-length combination did. 
Groups receiving the 2-8 sequences made more 
perseverative errors than those receiving the 
4-6 sequences, who in turn, were less accurate 
with the longest run length than the 2-3 groups. 
This, as the authors pointed out, suggests that 
variability as defined by the difference be- 
tween the two run lengths determines per- 
severative error responding. 

Limitations of such a model are put in 
clearer light by a study of Jones and Erickson 
(1968) in which mean and variance of run 
structure were identical for two groups, but 
one group received a recurrent pattern of run 
lengths, while the other received randomly 
arranged run lengths. Contrary to Gambino 
and Myers’ predictions, subjects responded to 
sequential regularities, particularly when such 
runs followed the longer run lengths. This 
finding agrees with earlier work of Rose and 
Vitz (1966) in which rules applicable to one- 
half of the trials in sequences (x = .50) 
generated different patterns. Patterns of the 
same length were responded to more ac- 
curately if they represented simple modifica- 
tions of the run structure (e.g., E:E1E1E1E2E2) 
than if they were complex (e.g., Ei. Ei: E2Ei:E2E2). 
In some conditions, subjects learned run-run 
contingencies, and, often these regularities 
followed the longer run lengths. 

A modification of the run-generalization 
model that permits the learning of run-run 
contingencies assumes (a) with probability 
c, a subject will learn a particular run-run 
pairing on a given trial, and, with probability 
1 — c, a subject will retain the wrong run-run 
hypothesis; and (b) generalization occurs only 
after predictive errors (Myers, 1970). The 
model predicts that in a 100% noncontingent 
success schedule, subjects will exhibit only 
riodic solutions of run lengths. 

Myers used a noncontingent-event-training, 
noncontingent success transfer design to test 
this prediction. Two levels of y, the probability 


pe 
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run repetition, were crossed with two training 
Sequence conditions (2-5, 4-5). Periodic solu- 
tions in transfer were not observed; rather, 
subjects tended to generate solutions based on 
combinations of remembered run codes. The 
findings were interpreted in terms of a limited- 
capacity information-processing model in 
Which higher error rates are attributable to 
more complex and taxing solutions, The fact 
that subjects in the 2-5 sequences generated 
"more complex? solutions in the noncontingent 
Success transfer and also showed higher error 


rates in training was interpreted as Support for 
this view, 


direct manipulation of sub. 
) subjects re- 
degrees to the 
ns of homoge- 
patterns most 


increase perseverative 
at uncertainty 
ith mean run length and 
Us Sensitivity to run-run 
redundancies in the se- 
similar lengths of sub. 
earn contingencies based 
on simple run modifications more readily than 
complex contingencies; (e) subjects rarely 

i Atistical rules that generate run 
Contingencies, but rather describe sequences 
1n terms of run combinations, 

Neither the Restle nor the Gambino 

Myers theory can handle these data ade- 
quately. Restle’s theory, because it does not 
Permit a confusion factor, cannot handle the 
Subjects’ decreased accuracy at certainty 
points or Systematic changes in repetition re- 
Sponding at uncertainty points. The Gambino 
and Myers model fairs better, albeit in some 
cases with inappropriate changes in Parameter 
estimates, But, the Gambino and M vers iden- 
tification of generalization with variance of 
run structure has Proved of only limited value, 
for it appears that either the range of run 
lengths or the number of run continuations 
between terminations is more critical in deter- 
mining responding at certainty points. Since 
range, number of runs, and variance have often 
been confounded in tests of the model, it is 
difficult to pinpoint the relative effects of the 


and 


different variables on error responding. a 
addition, the identification of the leto 
parameter with incentives and SES M 
and not with run combinations, is troublesoag 
in view of the fact that some run ne 
tions (e.g., 2.3) are disproportionately eas 
to learn than others, . Jr 
The data suggest that a confusability € 
is important in explaining sequential pré 
tions, but it is more likely some function 0 st 
number of different run codes a person m 
process, rather than of statistical variabi a 
of the run lengths. The number of ape Í 
subject may identify can, in turn, be a p = 
tion of the discriminability of the run er 
ponents. In sequence conditions character E 
by a few continuations between pae 
lengths (e.g. 4-5), it is likely that, at sf 
initially, a subject may be relatively succes ing 
both at certainty and uncertainty points v. 
one code ("it's 4 or 5”) to predict both dem 
lengths. That is, the subject does not ca 
mediately recognize two distinct run lengt ns | 
But in sequences with many continuatio” 
between the component run lengths (e-8» 
1-5), the subject is more likely to aitetip s 
two-code description of the sequence initia = 
Increasing the number of codes a ee ne 
must remember or organize thus may BEES 
his ability to attend to the ongoing sequenc z 
In the vein of limited-cognitive-processing | 
| 


theories such as those of Bruner et al. (1956) 

or Myers (1970), taxing cognitive "xp 

increases error rate. Myers assumes that ae 
number of run codes a subject must proce h 

hypothesis complexity aN 

that with high complexity the subject is mo! 

likely to lose count of the current run and guess 
| 


about the next event with probability equa 
to PEs 4| E;n). 
reflects the fac 
redundancies i 


: : e 
increasing th 
Proportion of 


r run repetition? 
Pending at uncertainty, 

its, not on counting accuracy, But Myers 

additional Suggestion—th, v \ 

very dissimilar run compor 

to cognitive Strain by Encouraging the subject 

to generate more "complex? hypotheses— y 

raises the problem of gauging thee s city 

of cognitive hypothese in 


» ith 
at sequences wit 
nents also contribute 


S that are ba. he 

a ase h 
same number of Fun Codes. Ate ee on d 
ain hy 
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potheses more complex because they result in 
higher error rates? Another solution to speci- 
fication of a relationship between independent 
variables and cognitive complexity is to 
assume, as previously suggested, that cognitive 
complexity is simply a function of the number 
of codes the subject entertains at a time, and 
this is determined by the discriminability of 
component runs and the number of irrelevant 
hypotheses the subject retains. This approach 
not only avoids scaling the subjects’ reported 
solutions but also allows the possibility that 
certain run-run contingencies permit elimina- 
tion of irrelevant hypotheses and thus reduce 
Cognitive complexity. In deterministic sched- 
ules, this potential generality becomes an 
advantage. i 


Deterministic Schedules 


Clearly subjects can learn simple sequential 
patterns; this was known long before Schoon- 
ard and Restle (1961) demonstrated experi- 
mamy that subjects quickly become pro- 
ficient at double-alternation sequences. Rather, 
the relevant questions are: (a) How does the 
Subject learn the sequence? (b) Are the psy- 
chological processes that control the subjects 
Performance in deterministic sequences similar 
to those operating when subjects predict fre- 
quency and conditional schedules? Do these 
Situations evoke entirely different processes a$ 
Estes (1964) suggests, or can psychologists 
examine processes relevant to probabilistic 
Predictions in the controlled environment of 
deterministic schedules? Perhaps, as Bruner 
et al. (1956) insist, commonalities such as 
Similar criteria exist at a higher level, which 
Systematically unites apparently different 
Strategies. 

Of the theories that describe predictions of 
deterministic schedules, few have espoused a 
simple conditioning approach using the run 
stem as a discriminative stimulus for the sub- 
sequent response. An exception is the model of 
Vitz and Todd (1967), which is an extension of 
Bower’s (1961) one-clement paired-associate 
model. While fits of the model to error dis- 
7p Diltions for different patterns (lengths of 6, 

» or 12 events) were fair, the model’s assump- 
tions that all stimulus patterns are equally 
difficult to learn was inappropriate, for con- 
tinuations of a given Tun Were more readily 
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learned than alternations—a rather common 
finding even in nondeterministic schedules. 
Moreover, the learning rate varied with pat- 
tern length, the longer patterns being learned 
more rapidly. 

More common are formulations that assume 
the subject proceeds in some step-by-step 
manner to construct a representation of the 
entire sequential pattern (Reber, 1967; Restle, 
1967; Simon & Kotovsky, 1963; Vitz & Todd, 
1969). Such models vary in their emphasis on 
mechanisms such as rule learning. Both Hake 
and Hyman (1953) and Wolin et al. (1965) 
have argued that with conditional schedules 
subjects do not attempt or succeed in learning 
the statistical rules that govern the sequence, 
but rather they tend to reproduce subsequences 
of events in order to anticipate coming events. 
In deterministic schedules, where rule struc- 
tures remain constant over subsequence pres- 
entations, it is reasonable to expect data bear- 
ing on this issue. But too often experiments 
may be interpreted in support of either position. 

Early examples of this are studies by 
Galanter and Smith (1958) who attempted to 
show that subjects identify recurrent regulari- 
ties in sequences and construct a linguistic- 
type model that is isomorphic with these 
regularities. Using auditory inputs, Galanter 
and Smith presented subjects with learnable 
cvcles of binary digits. Cycle complexity 
(number of different runs), not cycle length, 
determined difficulty. Subjects reported sud- 
denlv recognizing the regularities, construct- 
ing a model, and testing it. Randomness, or 
stimulus interference, is assumed to be a factor 
that prevents adequate model construction, 
and Bruner et al. (1959) demonstrated that 
even two random disruptions in a simple 
pattern (EE1E:) seriously degraded model 
construction. Analysis of error distributions 
within the sequence in light of possible rules 
would have clarified descriptions of subjects' 
reports. Such sequential analysis bears on the 
simpler and perhaps more appropriate ques- 
tion at this stage: How and in which order 
does the subject proceed to derive rules or 
construct à model? For if model construction 
exists, the mechanics of it are disappointingly 
vague. à ; 

A promising line of investigation into per- 
ceptual processes involved in model construc- 
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tion comes from Garner and Gottwald (1967). 
Subjects were trainedon asimple (EXEAE;E;E;) 
anda complex (E;E;E;E,E;) repeating pattern. 
In one condition, subjects began at any point 
in the pattern; in a second, subjects Were re- 
quired to begin at one of the five different 
Sequence positions, thus yielding five groups 
Predicting different initial versions of the s 
pattern. Analysis of the subjects’ pattern 
descriptions revealed preferred modes of 
Organizing the events, and these methods relied 
heavily on the use of runs and alternations as 
importantly, 

a function of 


ame 


the subject's initial expectancie: 
alternation) about the Sequence. The simple 
pattern was easiest to learn when Started at 
the onset of the run of three, one of the two 
preferred descriptions, Subjects learned the 


adily if the arrange- 
ha single-alternation 
ed in an arrangement 
eliminated this expectancy. 
is is the finding that subjects 
Were generally most accurate in predicting the 
event following the longest run. This suggests 
that disconfirmation of dominant expectancies 
toward alternations, which is supplied by 
long runs, is a Potent source of information 
for the subject. Longer runs apparently played 


Strated similar trends, 

Multichoice paradigms offer some flexibility 
in further examining the mechanisms of 
pattern learning. Restle and Brown (1970) 
Presented subjects with four forms (initial, 
inverted, transposed, and inverted-transposed) 
of two six-choice serial patterns. Error profiles 
reflected the subjects ability to abstract sys- 
tematic Properties of the sequence such as 
trills (i.e., E;E;E;), a finding difficult for condi- 
tioning theories based on single-event chaining, 
serial position, or Subsequences of prior events. 
Significantly, a major source of error was 
attributable to expectancies the subject ac- 
quired about the Sequence structure. 

The parallel, Suggested by Restle and Brown 
(1970) among others (e.g., Wolin et al., 1965) 
between learning a sequential pattern and 
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learning to understand a language E 
Chomsky, 1968) is a useful one. A Pd ae 
its application is the substitution ks oí 
analysis of sequence structure for analy M. 
psychological processes that may bea ww e 
of syntactic structure. In this regard, P 
(1967, 1969) has elaborated the percep M 
learning approach suggested for henian 
schedules (Reber & Millward, 1968) w a 
deterministic schedules, In this context, ra 
ceptual learning is of an abstract P n 
matical” structure, and according to -— 
is “implicit,” Tepresenting a primitive i 
requiring no higher-level intellectual fac d 
Just as a child may learn to speak and ux 
Stand a language before he can learn — 
grammatical rules that determine his "— f 
a subject may learn to use a n ian ale 
complex way without evidencing formal ke d 
learning. Using artificial grammars with ‘hat 
letter vocabularies, Reber (1967) showed t a 
with randomly constructed letter seguen is 
subjects’ learning curves reach an at 
stable asymptote, while subjects presen re 
with different “sentences” generated by ru 
continued to improve. With similar teda 
Reber (1969) also demonstrated that trans i 
Proceeds along lines of grammatical structu! 
primarily and is not vocabulary specific. - 
Implications of this finding are pin uud. 
If elementary patterns such as runs are con A 
bined according to rules, then the resulting 
regularities of these patterns (Le., the gram- 
matical structure) may be more critical to the 
subjects prediction mastery and transfer than 
the simple patterns themselves. Just as run 
Structure defined by 
transfer in probabi 
order rules or regulariti 
may mediate tr: 


g in defining aspects 
80r similar processes- 
StanCy of pattern length oF 
d-members could signal the 
and facilitate over all applica- 
regularities (Fritzen & Johnson» 
mplies that cues or expectancies) 
bject may use initially to identify 
Signal perceptual units tha; must 
analyzed more carefully usin£ 


of the patterns by countin 
Cues such as con: 
discriminative en 
end of a chunk 
tions of the 
1969). This i 
Which the sul 
regularities, 
finally be 
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ue coding devices, such as counting. Thus, 
unting may be specific to inputs in a later 
stage of learning. 
vos us dos iden to this is offered by 
coo ia (1969). They assume that sub- 
of iie n ; primarily to homogeneous groups 
ünits pes then proceed to combine adjacent 
fashion vs binary case) in a hierarchical 
complexit e. Garner's (1962) terminology, 
be Een, hw binary (or trinary) patterns can 
ated over i D sum of uncertainties? evalu- 
series of *s T oen of the coding process. A 
trinary a with repeating binary and 
jap Cems, (Vitz, 1860; Vire & Todd, 
could be ae that the complexity measure 
plexity uds to predict recall accuracy, com- 
descriptive gments, and mean number of 
measures PN independently of response 
sania wes co the model is nonmetric, 
sponding that the subjects’ first level of re- 
applied i perceptual, the model cannot be 
ever, if it sequential prediction data. How- 
Pattee i San be assumed that any gram- 
first in Pon sequence is approached 
AE Fi manner, and secondly in a metric 
promising p: this parameter-free approach is 
highly "ii he complexity measure correlates 
measure y Royer i and. Garner’s (1966) 
The Vitz a PE eed difficulty (r — 93). 
to handle d Todd technique has the potential 
first-orde the relative contributions of m and 
tingenci T contingencies, as well as con- 
tion } €s among runs, albeit each contribu- 
On has equal weight. But because the tech- 


ni ; ight. - 
ag is not metric, it would have difficulty 
learning based on contingencies between 

re or less 


Fe SNUUS that might be mo 
Berge heec in terms of lengths, à 
hmon in binary sequences. 

Other experiments with deterministic sched- 
ules are relevant. Both Keller (see vitz & Todd, 
1967) and Derks and House (1965) found that 
errors are a function of the number of runs in a 
Sequence, not sequence Jength—findings com 
Patible with Vitz and Todd's interpretation. 
Derks and House also reported that given the 


circum- 


* Specifically, 
k 
Hia” x Hi 
arner's terminology, 


j G: 
where H; = Hmax + Hioit "f, els of code analysis. 


in 
and k = final level of several le 
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same number of event runs, homogeneous run 
sequences (sequences in which runs were of 
the same or similar length) produced fewer 
errors per response than heterogeneous se- 
quences with mean run-length constant (4 
or 8), a finding also consistent with the com- 
plexity measure of Vitz and Todd. Thus, this 
uncertainty index does reflect the notion, use- 
ful in both semirandom and deterministic 
sequences, that complexity is, in part, a func- 
tion of the number of difierent run codes with 
which a subject must deal. 

If, as Vitz and Todd suggest, the run-encod- 
ing mechanism is a primitive perceptual one, 
then the existence of responses in a prediction 
task could conceivably disrupt encoding and 
constitute a source of error. Others (Bruner, 
Wallach & Galanter, 1959; Garner, 1962, p. 
287) have argued precisely this, but Reber and 
Millward (1968) seem to suggest that the overt 
response has little or no effect on responding, 
at least with randomly arranged events. Ex- 

erimental evidence is conflicting. Bruner et al. 
(1959) found that with a simple repeating 
pattern (ExExE2) subjects not required to 
make overt responses performed better than 
those who were. This was not supported by 
Garner and Gottwald (1967) with patterns of 
length 5. Using even longer patterns of runs, 
Derks and Larson (1969) found that subjects 
who observed the sequence took significantly 
fewer trials to learn six binary patterns than 
subjects who made prediction responses or 
than subjects who merely counted the events 
in the runs. These authors suggest that where 
overt responses interfere, they interfere more 
with the subjects’ organization of run-length 
orders than with perceptions of run lengths. 
In this regard the data may also be taken to 
indicate that subjects do not count events in 
runs initially, so that forcing subjects to count 
at this point in learning is just as distracting 
as making a prediction response, thus retarding 
initial perceptual learning and, so, learning as 
a whole. But the failure of Garner and Gottwald 
to obtain interference remains unexplained. 

In contrast to perceptual learning ap- 
proaches are the models of Simon and Kotovksy 
(1963), Feldman (1963), and Restle (1967). 
The differences between the latter theories and 
perceptual learning approaches are in assump- 
tions about the mechanism of learning. All 
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three assume that the subjects" approach to 
learning a pattern is Systematic, proceeding in 
a logical hypothesis-testing manner. For 
Simon and Kotovsky and for Feldman, sub. 


mandatory rules that 
lengths in an all-or 
locations in the s 


1965) describes opti, 


the subject Proceeds from 
an initial Suessing state to 4 hypo 


are presumably 
the way runs combine, 


' groups on the 
2E3E;) after 72 


, but all 
Sroups showed roughly the 


y rule locations were fit nicely by 
arkov model using the same 


Stimates for data from all five 
Ocations, but the three-state formulation pre- 


dicted only two of the four Optional locations, 

The model offers useful Ways of conceiving 
of Sequential learning if Parameters can pg 
more clearly linked to meaningful experimental 
variables. "The probability of error in the 
intermediate State for optional rule learning, 
for example, might vary as some function of 
the number and type of mandatory rules (or 
runs) in a Sequence. In addition, the model 
implies that all mandatory and all optional 
rules are of equivalent difficulty. The data of 
Vitz and Todd (1967) and Royer and Garner 
(1966) invalidate this assumption for manda- 
tory rules, while Restle's own data discredit its 
application to Optional rule learning. Lacking 
specification of variables that might determine 
important differences in rule difficulty, or se- 
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quence complexity, precise prediction dd 
subjects’ order of rule learning is imposs! Ee 
But the type of model that permits descripti i 
of error distributions across sequence es 
has a. distinct advantage in mue pora 
over the Vitz and Todd approach in which t 


ie 5 ifficulty 
entire pattern is simply given a difficulty 
index. 


In summary, 
sequences are n 
conditioning 


Predictions of pim 
Ot accounted for by v 
assumptions whether based ing 
Position. Perceptual d 
Some support in structu ch 
mediated transfer and TOxpongo-ia tto Teo" 
Studies, but the mechanics of such eva 
ing remain vague. More complicated pr^ 
have combined proposals of hypothesis bae 
ing with hierarchical processing assump em 
(e.g., Restle, 1967) Suggesting several bee 
Stages of learning and permitting Xs 
theoretical predictions, Although Restle’s SS 
dence and an analysis by Derks and Hot "d 
(1965) support such discrete-stage ipee 
lions, these notions remain essentially hollo s 
until effects of experimental variables sudi 
sequence complexity are tied to learning 
Parameters, 

Specification of the role 
ture at different learning 
offer a means of combinin 
assumptions with 
testing axioms. 
Proaches seem 
emphases on se, 


of sequence poe 
Stages may, in ies 
8g perceptual Le dir 
more formal hypothesi 1 
At this point the two SE 
to place relatively pce 
quential properties at med 
learning Stages. Hypothesis-oriented meore 
(e.g. Myers, 1970; Restle, 1967) maintair 
that subjects initially acquire an unorderec 
catalog of run codes by means of counting, 
and that higher-order Structural constancies 
Such as run-run contingencies may not affect 
this lower-order learning (Myers, 1970). Ac- 
cording to a Perceptual learning approach, the 


Subject attends initially to higher-order struc- 
tural 


Constancies, and these rules would 
mediate component learning; counting 15 
Specific to | 


ater Stages 
Some index of levels of Sequence Structure, it$ 
regularities, and its relationship to either 
Ypothesis testing or Perceptual learning would 
be useful, 
In thi 
(1969) 


of learning. Clearly, 


S respect. the 


index of Vitz 
represent 


- 7 and Todd 
S à useful beginnj 


DS for at- 
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ter : 
only the a ew. The measure reflects not 
Sequence m n pel hig run codes contributing to 
salient a but also other potentially 
event freq e tid properties such as marginal 
the LT 2 Some justification for linking 
as pieniiivs pes ha psychological concept such 
Correlation POENI derives from its high 
Organizations 1 E of learning and 
tions in gen 2. pt Despite some limita- 
for dados. "d (e.g., the lack of allowance 
metric x Bee of number of coded runs with 
"yg tat it offers a basis for 
With the oon sequential complexity increases 
if the Sübiect ber of different runs present. Thus 
binationg. th Tests hypotheses about run com- 
ject must Se number of hypotheses the sub- 
complexity p^ also increases with sequential 
tations ae ur thermore, assuming some limi- 
Been subjects cognitive capacity, 
and hypot? dar require much hypothesis testing 
Cognitive S revision would tax the subject's 
Would su iu d Clearly, such an analysis 
quency i me that the highly complex fre- 
erent” run ee, in which a variety of dif- 

Place maximun are haphazardly presented, 
while semira =" cognitive strain on the subject, 
with fewer E on and deterministic sequences 
in fewer hy os are less complex and result 
are less iiei heses, Thus, the latter sequences 
notion that Xn result in cognitive strain- The 
a function i sea complexity increases as 
Subject must te ne number of hypot : 
size a hyp test and retain appears to empha 
higher-or] pothesis-testing process. me 
perceptu: * Structural regularity so Gaisa B 
ble with ü ‘earning theorists is not incompaty 

a vehicle E approach if structure is dun 
eliminate zar permits the gubjes E. m 
combinati irrelevant hypotheses 2 ac deem 
allow M DES. Similarly, initial oP | pem 
Potheses . subject to operate with Dyer 
ses about the run patterns, and BO ea 

the subject 


heses the 


mav M x 
a Suide the way in which is ul 
“ends to ; 5 her words, struc- 
s to a sequence. In oth 5 
ons ancies deter- 


] expect 


tur, 
so, in effect, may 


ep uarias and initia’ 
eda y pothesis saliency, an 
„uce cognitive strain. —. ain then may 
he reduction of cognitive c lective at 

be accomplished by the gubject’s 5€ pen at- 
tention to certain sequential cues as deter- 


mined either by expectancies or higher-order 
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regularities. Strategies apparent with schedules 
of different complexities can be understood in 
terms of mechanisms that permit a subject to 
avoid or reduce cognitive strain. A subject 
may weigh certain aspects of the sequence 
differently than others and so eliminate some 
hypotheses. Initially, the subject's attention 
is governed by expectancies or dominant hy- 
potheses, and if sequential properties are 
congruent with these expectancies, the sched- 
ule's properties will be rapidly mastered. In 
deterministic schedules, rapid elimination of a 
dominant hypothesis facilitates the subjects’ 
learning of the pattern by permitting the sub- 
ject to shift attention to potentially useful 
structural constancies, such as longer run 
lengths. The pattern itself can serve as a mech- 
anism of attention that provides serial integra- 
tion of cues and so permits the subject to 
efficiently eliminate irrelevant hypotheses, a 
ossibility suggested by Neisser (1966, pp. 
233-234). The long run lengths, by invalidating 
certain initial expectancies and guiding atten- 
tion, may serve as anchors about which the 
subject can systematically attach other run 
this integration process. Such a 
is impossible in nondeterministic 
the subject cannot eliminate 
heses systematically by using 
ments such 


codes in 
strategy 
sequences where 
irrelevant hypot 
overall structure, or recurrent ele 
as long runs, to reduce cognitive strain. Here, 
attention focuses on different cues to ac- 
complish this. Initial expectancies are apparent 
in these schedules, primarily in the form of 
negative recency and alternation responding. 
But disconfirmation of dominant hypotheses 
in these schedules is often neither rapid nor 
consistent. However, in semirandom schedules, 
where reinforcements consistently define the 
run codes, the subjects eventual reliance on the 
current run length is clear. In frequency sched- 
ules, the asymptotic stability of repetition re- 
sponding and idiosyncratic solutions reflect the 
subjects focusing on recent events and certain 
salient run patterns. This analysis suggests 
that the subject tests only the limited number 
of hypotheses to which he can attend and that 
the nature of these hypotheses are determined 
by expectancies and constancies of the rein- 
forcement schedule. Reinforcement then be- 
comes an instructional agent for an attention 
process, and the potency of a negative rein- 


"- 
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forcement may bea 
disconfirmed hypo 
which it is disconfirmed. 


Processing 
Bruner et al, (1956) 


uncertainty 
plexity, but the conditioni 


quency schedules Permitted the 
simplifying assumption o. 


f equal-samping prob- 
abilities of Patterns associated with prior 
events (e.g. Atkinson & Estes, 1963), but it 
also ran the tisk of obscuring Psychological 
Processes such as hypothesis testing. The 
evidence that subjects remember and test 
certain types of stimulus elements and selec- 
tively ignore others has accumulated, but often 
with the aid of more controlled 


Concerting facts for 
approach. From the 
ject’s performance over a Vi 
it is apparent that a major r 
Which i i 


of the sub- 
ariety of schedules, 
evision of thinking, 
8 assumptions as well 
of number and kind of ele- 
ject encodes, is preliminary to a 
complete model, Precise relationships between 
response Strategies and the number and nature 
of stimulus elements remain to be Specified, 
ut research that acknowledges the require- 
ment that stimulus Specification is important 
may ultimately prove more useful than that 
Which makes little attempt to link model pa- 
rameters closely to stimulus Properties, 


Memory 


Conditioning models cannot explain memory 
of events Prior to the most recent one, but 
evidence from the previous section reveals 
that in many cases Subjects do respond, often 
selectively, to Prior sequences of events. 
Models such as the Gambino and Myers model 
can handle the effects of Prior events on learn- 
ing by assuming the true run length is the con- 
ditioned stimulus on cach trial. Errors are the 


io 

result of generalization of responses T iM 
ditioned run length, not the result of fa tion 
of some memory mechanism. But the eal 
of a span of memory is clearly compatib pn 
information-processing theories in — nare 
ory stores of limited informational capaci 1968; 
hypothesized | (Atkinson & ange e run- mg 
Myers, 1970). In line with this are th "e. 
coding models that assume subjects can €! 
| 


imite 
tively increase the capacity of a ET 
memory-store by coding individual Pealen 
run chunks, a notion consistent with J Thus; 
original (1956) chunking hypothesis. ealing 
recall studies have the potential of ee. to 
(a) how the subject codes event sequen Cubs 
Store in memory, (b) how many on "fect 
ject can store, and (c) what variables xu 
Storage of these codes. In assuming a met sup- 
Store notion, however, Some evidence to ning 
Port this interpretation over a conditio 
view must be examined. 


Recall M. emory 


Reber 


st 
and Millward (1965) set out to tes 
the 


no-memory assumption of simple em 
tioning models by requiring subjects to iS Is 
an event xy (y = from 1 through 9 pic 
earlier in a frequency schedule. Correct E TS 
Was highest for the first three or four ae 
but. subjects instructed. to recall over ent 
trials back were Poorer recalling more rec 

events and overpredicted r. 
Eested the Possibility th 
volved in rem 


in a random Sequence 


ai 
STET. | 
forced these subjects t ! 


; ent 
mg on the more mar 
the first to contradict n 


ment with remote Events could be attributed 
to the number of runs in the n—x span- 
Memory was limited to four runs, and when 
the Subject did not recall the remote event, hi5 
was determined by ee 
: the Subjects memory of a frequen, 
azan (1969) confirmed 
do chunk in these situations- 
0, and three widely space probe- 
» Subjects reported betwe 
te Subsequences of Events, 


recall trials 
i discre 


$ 
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for M ; 
called chitinks d dh determinants of re- 
Schedule and o 5 acking. Use of a random 
manipulative power i value clearly limits the 
determi power the experimenter has over 
s nants of the subject's recall. 
TERRAE A 
quences with tw he sequence. Using se- 
either one or ve run lengths separated by 
and 2.5 4-7) aree continuations (2-3, 5-6; 
y ons -— with four run lengths separated 
tested sub inuation (2-3-4-5; 4-5-6-7), Myers 
engths in pess recall of the last five run 
€call ac 5 series of 10 probe-recall trials. 
tance E of runs decreased as the dis- 
nuber s the runs increased and as the 
Parallels ^ : un lengths increased. This finding 
Dersever; the effects. of these variables on 
and E errors in prediction paradigms 
b nes additional support for Myers 
and te Senet both number of run lengths 
to inre e between the two contribute 
hile esis complexity. 
eos [e effect of run-run contingencies on 
compariso not been systematically studied, a 
retain a n of the subjects’ relative ability to 
Symbols sequentially presented series of 
pattern ordered either randomly or with a 
jects SÉ presented by Ross (1969). Sub- 
Ordered ained four to five items with randomly 
ordered Sequences, but significantly more with 
codes p^ Sequences. Thus, arrangement of 
suppl either increases memory capacity by 
in ying a hierarchical coding scheme for 
1 Dut, or facilitates retrieval of codes from 
memory, 
p» Tn this respect, two studies of recall of binary 
E ds helpful. Glanzer and Clark Aue 
eight Es subjects with 256 arrange x 
record, inary shapes (.5-second EXPOSES, 16- 
E reproduction accuracy ber E 
Curac response interval. They jot ased 
up io; decreased as number 9 TUroximated a 
single à point where the array 2PPr E 
S'e-alternation pattern- simple run ^y 
Por GSS ca a the curvilinearity re- 
sulting og a bi cts’ recalling single 
alternations 5 the subject? "v pps compared 
i . Schwartz an OSEE. 
recall of 25 of length 84 nd 25 patterns 
of length pene eng sith Glanzer and 
0. In agreement Y 
sSchwartz, B, J., & Lordahh UA Nadal 
sequence Structure on recall. Paptr PXovember 1969. 
of Psychonomie Science, St. LOY 
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Clark, they found that the sequences recalled 
better were characterized by a few runs; in 
addition, sequences with imbalanced runs 
(Le., very long and very short runs) were 
easier to recall. These findings do not com- 
pletely agree with Myers! analysis of recall in 
semirandom sequences in which both the num- 
ber of runs and the distance between the runs 
increased recall errors. Recall of the more dis- 
criminable run codes in deterministic schedules 
was superior to that of homogeneous run se- 
quences. Explanation of this discrepancy un- 
doubtedly rests with the relative role of run- 
run contingencies in the two kinds of sequences. 
In both, the existence of several distinctly dif- 
ferent run lengths may stimulate generation of 
more run-arrangement hypotheses (Myers, 
1970) than several similar run codes. But, the 
constancy of run-run contingencies in deter- 
ministic schedules allows the subject to rapidly 


eliminate the irrelevant run-based hypotheses 


in these sequences. 
The fact that orderin 
ministic sequence can à 
subject proceeds to learn th 
difficulty of predicting that se 
that there may be more an 
orders of encoding the component run lengths. 
And, correlations between organizational dif- 
ficulty of a pattern and its recall difficulty 
(Vitz & Todd, 1969) suggest that encoding 
order does operate on memory. But specific 
mechanisms such as role of long run lengths in 


recall are not clear, often because long runs 
are confounded with the existence of fewer 
r with run heterogeneity. A technique 
used by Lachman and Dooling (1968) for com- 
paring free recall of connected discourse with 
that of randomly ordered words could be use- 
ful in this context. Using free recall measures 


after 2, 4,6, or 8 presentations of the sequences, 
they found that organization of the lists re- 
flected in word-order regularities proceeds 


comparatively slowly with random discourse. 
They suggested that subjects form small core 
units around anchor words to which additional 
units are added on each input trial. Several 
studies (Garner & Gottwald, 1967; Royer & 
Garner, 1966) have indicated that subjects do 
organize at least patterned sequences, and 
many prediction tasks indicate that perhaps 
longer run lengths could function as anchor 
stimuli in these sequences (Garner & Gottwald, 


g of runs in a deter- 
flect both how the 
e sequence and the 
quence indicates 
d less efficient 


runs O 
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1967; Tones & Erickson, 1968; Rose & Vitz, 
1966; Royer & Garner, 1966). 


Memory Aids 


Speculations about the role of memory in 
determining the subjects? Strategies raise the 
question of the usefulness of memory aids that 
display prior events. In concept identi 


ink, 1964), 
performance increased with increases in the 
number of available instances up to a limit, 
Consistent with Posner’s (1965) analysis, en- 
hancement from the display was greater with 
more complex concepts. Few Probability learn. 
ing studies have Systematically examined the 
role of memory aids; many which have dis- 
played prior events (Erickson, O'Hara, & 
Jones, 1968; Singer & Roby, 


€t al., 1965) have not found unequivocal sup- 


aids. In part, 
Y characteristics 
; in part, it may be 
Sed were too complex to 
any enhancement 
ting plays 
& demonstration of this might 
be clearer in controlled sequenc 
rect learning Strategy 
some points in the Sequence, 
Using semirandom Sequences with runs of 2 
and 6 or 5 and 6, Butler (see Myers, 1970) 
crossed three display conditions (no display ; 
Current run display; 12 immediately prior 
events display) with two instructional condi- 
tions (neutral; informed of run lengths). Ac. 
cording to Butler, if errors at certainty points 
in Semirandom Sequences are the result of the 
Subject's failure to remember the length of the 
encoding it (Le, at its 


Current run at the time of 
termination), then a display of that run should 
eliminate errors and so equate the normally 
Poor performance on 2-6 sequence with that of 
the 5.6 Sequence. According to the run- 
conditioning model, a run display group should 
not differ from the other two display groups. 
The results revealed an interaction between 
current run display, sequence condition, and 
instructions with the 5.6 sequence resulting in 
more perseverative errors than the 2-6 for 
those subjects instructed as to what run lengths 
the sequence. For subjects 


would be present in 
uninformed about Tun structure, performance 


rence 

Was, as it usually is, worse on the pie fun 
than the 5-6 sequence with the Su short- 
displayed. These results suggest pin oan 
term memory mechanism for. cur licting 
length undoubtedly operates in p! ory ai 
semirandom sequences, but that amem ading 
cannot compensate for nondiscriminate uences 
of runs such as might occur with 5-6 pm is 
where the Sap between the two run ned con- 
small. This Study represents the firs „store 
clusive evidence favoring a MO oat a 
interpretation of sequential responding 
run-conditioning approach. eno 

In summary, results from both bat subs 
displays and Probe studies suggest pedes 
jects group event sequences into coded ore ap 
and that the subject's behavior is russi 
Propriately described by information-pro condi- 
models than by event or md edules 
tioning assumptions. For frequency pam ‘ined 
the nature of these chunks is not deter "lihiood 
but memory-aid findings point to the Ue jectS 
of the subjects’ encoding run lengths. pre- 
can recall up to three or four immediate 4 of à 
ceding events, and four or five chunks ou T 
long series of randomly scheduled events. seul 
number of possible run codes in a gearen 
important in determining recall ability; unh 
runs are all of the same length, sequences "phe 
more runs are more difficult to recall. or 
relatively greater number of hg c 
arrangement rules the Subject must emer 
with complex sequences undoubtedly Pc 
tributes to the subject’s memory impairm' d 
in these cases, Work in related areas supp? re 
this notion: Denny (1969) found ihe ns 
complex the Problem, the more likely is pe 
effectively reduced for 
introduction of 
permit the Subject to e er- 
hypotheses, and so improve his recall. Det 


Nis 2 : atten" 
ministic rules may direct the subject's att 
tion to modes of org 


ment that enhance recall. The relative dil- 
ficulty of different a 


m [^ 
Trangements on recall m- 
the arrangement rules themselves and co! 


Ponent run codes is not known, but approacht? 
Such as Reber's Suggest Superior recall of rule? 
and these might Mediate transfer of individu 
Tomponents, ther reserach (Domino wskh 
1963) this expectation. 


- - hat 
contingencies ae 
liminate irreleva 


v pan gen 
‘nizing the run arrang 


965 supports 


bein 


Finally . 
ES — the subjects to recall prior 
facilitate an Tequeney . schedules seems to 
Subjects vel dee ak. strategy in which 
event. eh By on the more frequent 
is the result Ra Millward suggest that this 
his is consi z memory overload or strain. 
more Dore u with the view that in the 
attend and wl de schedules, subjects 
ase respondi A only highly salient cues and 
Plicated tiie ng on these cues in an uncom- 
Where subjects re In less complex sequences, 
Subjects? str s recall three or four chunks, the 
tempting eic suggest the subject is at- 
and on tines accurately predict both onsets 
ministic eie of runs, but only in deter- 
Strategy, edules is this problem-solving 
Y completely successful. 


en INSTRUCTIONS 
tom ne d learning theorie 
with cure handicaps as ot 
Statistical je to instructional variables. E 
Suessing Sere theory presumed “neutral” 
room for en Riga in which there was ample 
Point of vie -instruction. From a theoretical 
ables taises d the study of instructional vari- 
instructions the question of what process do 
indeed elimi affect. Do “neutral” instructions 
eschew oa. this process, or do they merely 
Potent effe rol of it? Instructions clearly have 
Corporates cts, yet no model adequately 1n- 
Structure instructional set into its formal 
Clearly fo so that few proposals have been 
ather EM in instructional researc: 
Not as jar structional effects are often treate 
but a rmal aspects of a theoretical process, 
E 8 epiphenomena. 
DUNS prediction tasks, the 
as et, regarding instructional T 
research [o m.a disproportionate ey a 
(1954), Flo: an early formulation DY eiil 
approach ood maintained that @ su T atey 
of always P Sate opti t event 
if he is ¢ choosing the more frequ' is truly 
random — that the sequent cQ) for 
example. Morse and Runqus ing ‘their 
own “ray found that subjects crea higher 
ndom" se c tended to Shon iy a 
P(A) th: quence 1 (1961) at- 
1) than control subjects: P? “ies 
tempted to encourage pure optimal strates! 
by allow; age P go ndom 
y allowing some subjects tO evalua 
sequences periodically, but 5" jects 


s have suffered 
her theories 
arly 


paucity 
effects 


te ra 
with sup- 
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posedly more information about randomness 
were not markedly more likely to be maxi- 
mizers. In neither study did a majority of the 
subjects reveal pure optimal strategy respond- 
ing. Flood's notion assumes that (a) subjects 
can and do formulate an accurate concept 
of randomness; and (b) in the face of this 
randomness, subjects will not problem solve 
but will accept the rationally optimal strategy 
of maximizing. It is dubious whether the former 
assumption is valid in view of what is known 
about subjects’ attempts to reproduce ran- 
dom sequences (e.g, Tune, 1965). This seems 
to leave the second assumption virtually un- 
testable. Despite this, several subsequent 
studies have persisted in trying to correlate 
“randomness” instructions with higher P(A1) 


levels. 
A number of these stu 


define the conditions un 
will match response probability with marginal 


event frequency, as well as conditions that lead 
to overshooting. Problem-solving instructions 
(Goodnow, 1955) have been indicated as con- 
tributing to matching, but in Goodnow’s study 
a number of variables (monetary incentive, 
task structure, etc.) covaried with instruc- 
tional variables, so that it is difficult to 
draw unequivocal conclusions with respect 


dies have attempted to 
der which the subject 


to instructions. 

In 1962, both 
hout, and Voss vi 
and instructional set, 
quency sequences (r = .70). In the former, 7 
was randomized over blocks of 50 trials; the 
most important finding was that subjects who 
predicted the color of marbles rolling from a 
box had significantly higher asymptotic P(A 1) 
than control subjects who predicted binary 
events without a box. This was interpreted as 
support for Flood’s hypothesis; it was assumed 
that the box made randomness explicit. How- 
ever, Nies also reported that no subject maxi- 
mized completely, and 75 of the 128 experi- 
mental subjects reported searching for patterns 
of runs. McCracken et al. (1962) randomized 
over blocks of 20 trials in two experiments. 
They found that subjects instructed to predict 
events on a trial-by-trial basis matched r and 
were sensitive tO sequential aspects of the 
schedule (i.e., the evidenced negative recency); 
instructed to avoid trial-by-trial 


Nies and McCracken, Oster- 
aried outcome information 
respectively, in fre- 


subjects 
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analysis or to problem solve for blocks of trials 
exceeded .85 in A, proportions and were not 
Sensitive to sequential properties. The study 
of McCracken et al. Tepresents an improve- 
ment in instructional research over earlier 
attempts to convince the subject of random- 
ness. In this study, instructions were used as 
a tool to direct the subject’s attention to cer- 
tain aspects of the sequential task, Assuming 
attention responded to these concrete direc- 
tives, we can conclude that if subjects attend 
to the trial-by-trial fluctuations of a sequence, 
they may be encoding cues such as Sequential 
patterns or runs, whereas if subjects attend to 
larger blocks of trials, relative eve 
becomes a more salie 


Sponse strategies change accordingly, 


when they ntial properties 
they will no eterson and Ulehla 
(1965) Permitted one Stroup of subjects to throw 
a die for each Prediction, while a control group 
Predicted the same prearranged (m = .67) se- 
quences. In accord with Ni 
group exceeded 7, with 6 of the 21 s 
showing a m 
Beach (1967) c "ng 

that the die- ing i 


ubjects 
£y. Swensson and 
cir interpretation 
duced Subjects to 


al properties and maximize, 
Using Peterson and Ulehl, 


Strated with run curves 
contro] groups th. 
ignoring run str 
emphasizing to 
gambler's fallacy, still was not effective in 
orcing subjects to abandon run-dependent re- 
sponding. The authors concluded that sub- 
Jects who showed a pure maximizing Strategy 
did not do so because “they appreciated the 
inappropriateness of run dependency 1 
282" Other studies instructing Subjects to 
ignore Sequential properties (Lathrop, 1967) 
Support the finding that subjects cannot readily 
do this, i 

In an attempt to directly control the sub- 
jects attention to sequential properties rather 
than eliminate it, Butler (see Myers, 1970) 
informed one Sroup of subjects of the run 
lengths and Corresponding responses in the 
Sequence and gave no information 


about run 
Structure to 


a second group of subjects. The 


: also 
design, outlined in the memory sec editions 
examined three memory-display € that 
and two sequence levels. Butler foo e 
either instructions or a memory aid and that 
anticipatory errors in both sequences Lo. How- 
the effect was greatest early in eme com- 
ever, the instructional effect apt 
plemented the memory displays in x through 
perseverative errors for both sequence t both 2? 
out all trial blocks. This suggests tha memory 
active attention process and ipee a an 
are necessary for appropriate encod 
use of sequential cues. . „d tow 

In summary, those studies b cad con- 
testing subjects’ use of a Kee p m un er 
cept in probability have found t domness 
conditions where a concept of "ran a fe 
must be inferred by the experimenter, p(A) 
subjects will maximize, thus pushing these 
above r. But it is not clear that "ad prop” 
subjects are totally ignoring seme jt 8 
erties such as runs. More important, 


1 cannot 
zpparent that the bulk of the subjects Í 
be forced by i 


having latent 
randomness or 
Flood originall 
attempt to 
rather th 


ard 


n 
In the few Studies that have attempted so 
Subject’s inclinations, it is nae 
ns can affect how the er 
attends to aspects of the prior event sequen » 
and this activity appears to supplement 
Subject’s memory of Prior patterns. o5 
z a) has pointed out that € " 
be instructions to suba ar 
tasks, but clearly o 
y encourage self-instruct 
itself Tepresents a power y 


agent in the face of “neut! 
This T 


that instructi 


ts 


In Probabilistic 
Variables too m 
The Sequence 
instructional 


DU 
al 
Instructions, 


i ; can effectively ser "d 
as instructional agents to direct the subject 
attention to Certain subsequence, of event? 


PROBABILITY LEAR 


and : j 
3 so encourage problem-solving rather than 
gambling behavior 


DISCRIMINATION 


Siete discrimination paradigms are 
bris of ie = the random presentation of 
each eae discriminative stimuli prior to 
Two-choice dran in a series of trials. 
cue type cope may be distinguished by the 
cues are more or 1,2,... R) presented; the 
event UE or less valid indicators of the 
The two-cu mes on the respective trial types. 
Studied bns Situation has been most frequently 
Teviewed in i constitutes the bulk of research 
Early rte section. me 
with ae: as was concerned primarily 
earning Caines theories of discrimination 
Estes, ieee 1958, 1960; Burke & 
and Estes i Suai & Mosteller, 1951). Burke 
elements i or example, assumed that sets of 
(T) ns E the discriminative trials 
of the E conditioned on each trial to one 
ability of a prediction. responses. The prob- 
trials E us on T; trials (m) and on T» 
dintepininae Pi the likelihoods that 
became Penis endently sampled on these trials 
totically, B lected to the A; response. Asymp- 
y, Burke and Estes predicted 


P 
(Ai| T) =(1— w)m + w[ Bm + a- B)r2], 


wher B 
an ia ba proportion of T; trials, and w is 
toane uf. trial similarity representing the 
sets, Son overlap between the two stimulus 
that P(A support for the model’s predictions 
with low JT) eventually approximates Ti 
Potes qn rr similarity was reported by 
Atkinson, ^ Burke (1955) and Estes, Burke, 

Atkinse and F rankmann (1957). 
the -—À observing response model 
fntetieetion of that the subject ignor 
trials. Aili of the two stimulus sets on some 
paying reae oa assumed that if subjects are 
perceive e on a given trial, they may 
The mod r stimulus cue as à unique pattern. 
represent r yields 16 Markov states which 
stimulus s hé conditioning status of various 

ES ets to observing responses and pre- 
diction responses, g 

In Contrast to simple co 
discrimination learning, 
growing interest in the 
strategies (e.g., Massaro» 


raises 
es the 


nditioning models of 
cent theories reflect 
subject’s response 

1969). Massaro 
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(1969) described a three-state Markov model 
in which the subject is conditioned to the ap- 
e, to the inappropriate re- 
sponse, or not conditioned. An appropriate re- 
sponse is defined as a response having the 
greatest likelihood of being correct on a given 
trial. Massaro’s theory, in tacitly assuming 
that the subject evaluates the strategy most 
likely to lead to success in a given context, 
marks a beginning of rethinking in discrimina- 


tion learning. But, in general, discrimination 
simple condi- 


research has been dominated by 
tioning models, and the implications of newer 
notions of response strategies have yet to be 
developed. Unfortunately, theories such as 
Massaro’s have not offered sufficient guidelines 
for reorientation of discrimination research 
which has tended to concentrate on parametric 
investigations of the effects of three variables. 
These variables are (a) vs, the probability of 
an E; on T» trials; (b) cue similarity or w; and 
(c) B, the proportion of T trials. 

A significant amount of the research centered 
about these variables has not unequivocally 
supported conditioning interpretations. Just 
as the complexities of sequential dependencies 
contributed to the inadequacy of no-memory 
conditioning models in simple probability 
learning, discriminative predictions as à func- 

cast doubt on condi- 


tion of B, Tis and w have 
tioning models of discrimination. But in simple 
probability learning, the decline of conditioning 


models has been paralleled to some extent by 
a rise in coding or hypothesis-testing models. 
This is not the case with discrimination learn- 
But inadequacies of conditioning models 
aised several theoretical issues that may 
overall formulation of sequential 
in both cue and noncue 
paradigms. At present no theory with both the 
predictive precision and generality of stimulus- 
sampling theory provides a comprehensive 
alternative framework for these issues. At 
least three; not unrelated, questions may be 
raised: (a) Do randomly arranged discrimina- 
tive cues operate simply as independently 
sampled stimuli with common elements serving 
as the only source of confusion? (b) Does 
reinforcement connect stimulus and response 
a passive trial-by-trial fashion, or 
e as an informative index to an 


-testing subject? (c) Is the 


propriate respons 


ing. 
have r: 
bear on an 
choice behavior 


elements in 
does it serv! 
active strategy 
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concept of task complexity relevant in the 
discrimination paradigm, and, if so, how does 
it relate to cue frequency and similarity? As 
the following sections illustrate, research has 
by no means answered these questions, 


Research with T2 


Two conditioning models m 


ake different pre- 
dictions about the behavior 


certain changes in POAT 


proportion of T; trials was varied (8 = «25. 
50, -75). Even more troublesome fora simple 
stimulus-element conditio; 


ning approach were 


the sequentia] Statistics, The 


Pil T; s T, AL gy 


Cj, k, m, $ = 1, 2) showed that s 
OW T» were more likely to repeat a correct re- 
Sponse with similar adjacent trials, but more 
likely to repeat an error if successive trial cues 
differed, A Strategy shift was apparent to a 
lesser extent in subjects with an ambiguous aT 
cue where similarity in adjacent trials en- 
anced a general tendency of these subjects to 
repeat correct predictions. But if 7o = m, 
Subjects did not reveal such shifts asymp- 
totically; instead they perseverated erron- 
eously, regardless of trial type. 
The complexit 


ubjects with 


Y of the subjects conditional 
responding is difficult for traditional dis- 
crimination theories, Massaro’s model can 
predict sequential Statistics for the symmetric 
Situation (i.e., "ec iru by assuming that 

* response most likely to 


reinforcement. of th 
be correct (i.e. the appropriate response) on 


ro- 
Tin strengthens the likelihood of yee: 
priate response on Te n41, regardless o y» pr 
these appropriate responses are A1 OF t com 
dictions. But even this model canno 3 


EX 
eee 7] Ti 4 

pletely account for predictions where 
= .85; 


i or 
observed behavior in this -— 
Suggests that a more comprehensive et is. 
of the role of cues in sequential pre E that’ 
necessary. In general, the data at whose 
trial types do not serve as neutral BEL E 
elements are automatically conditione em 
reinforcement. Rather, different cues EE 
Signal distinct Strategies to a subject, ES two 
failure to find a distinction between a error 
cues may ultimately result in a simplifie j 
perseveration Strategy. 


he 


Cue Similarity i 


«4 the 
Variations in cue similarity evan 
extent to which the asymptotic TP purke 
P(Ai| Ty) is affected by mə, according to qom 
and Estes. The P(Ai|T;) should eo? ap; 
asymptotically only when there is no ox” j 
9f T; stimulus elements (ie., when w A 
P(Ai| T1) should approach the overall me 
[r = Bri + (1 — 8) 


k y man 
Uhl (1964) tested assumption by 7 


ts 
nipulating the Proportion of common ete 
(0.0, .50, .75, 1.00) in a matrix of lights. Bad 
jects responded to the sets of lights as AE 
more compatible 1963 
Estes’ pattern model (Atkinson & Estes, 


criminability (Halpe: 
Massaro, Halpern, 
varied auditory discri 


2, 3 decibels) with similar values of mı 


e+! 
) and a, (50, .60) and required half b^ f 
lentify trial types prior to ehe 
- Increasing distinctiveness had, al- 
lect of Separating the two marg 
s palthough P(A |^ did not beod! E 
independe ith highly distinctive cuc* 
Complex conditional ^ 
P8; general, the probability of an ^^ 
ive a eS Sreater than after an Aj if | ; 
Successive cueg Were identified as different P) ~ 
the subjec If successive trials had the sam 
Were few inversions jy the typic® 
of conditionals. 


m & 


D 


rank-order 


PROBABILITY 


IPinallv 
e Meme pE al (OR) determined 

a d f n 
Was s cinson’s U-shaped P(Ai| T1) function 
ion parili to the highly distinctive cue situa- 
of audi oreover, increasing the distinctiveness 
bree titory cues (AI = 1.5, 3, 6 decibels) de- 
ceased the likelil i epe. 
Dersever ikelihood that the subject would 
tria] erate a correct A; prediction if successive 

A were different. 

^ majority of these findings appear difficult 


je f is assumed to inc 
conditione ip the subjects leaving an un- 
ability on. state and to decrease the prob- 
after Entering the unconditioned state 
an inappropriate conditioning. While 
Poem describes conditional inversions 
Tp oo. y well in the Massaro et al. study for 
Marginal Ta = .20, it cannot predict changes m 
evels of statistics with mə under different 
Sliinptions = similarity without violating as- 
vari; $ of parameter independence and 

I ance, 
TN nini neither variations in visual nor 
reinforcer stimulus similarity support early 
ing, lp on models of discrimination learn- 
ünition in terms of number of common 


elem * 
Ents ignores the fact that subjects may not 
ap as the experi- 


e 
rento, ments and overl 
Th r does and as Burke and Estes suggest. 
affect. fact that a distinct but ambiguous D 
lore Nie pa strategy on the alternate tria 
Aat Suhe a less discriminable T» cue indicates 
Mdepe; fe do not respond to dissimilar cues 
Shite over 7 levels. This finding is 
ary to the general spirit of a common ele- 


d Simp Conditioning model where decreases 1m 
i arity should result in less, not more, 
two types of trials. 
hypothesis can 


m 


V The ction between the 
hana appropriate response = 
in ae, some, but not all, of the data en 
Cues , convexity of P(Ai| T) with ma when the 
at are distinctive. Unexplained data suggest 
Subjects seek and define a distinct strates) 
Istinctive Tə trials in the absence of ex- 


Peri 2 
menter defined appropriate strategies- 


Pr " 

Portion of T, Trials, 8 
According to Burke and Estes, increasing the 
5 I. (8) increases the likeli- 


Dr 3 : 
l OPortion of T1 irials 
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hood of an Ai response asymptotically: 
P(AiT) = e(yi— T) + (1 — w)m + ems. 


Shaffer (1963) in testing the prediction that 
P(Ai| T; increases with 8 found that only 
P(Ai| Tı) increased linearly with higher pro- 
portions of Tı trials. The P(Ai| T») statistic 
decreased with increases in B. Others (Sum- 
mers, 1968) have suggested that the relative 
contrast of event frequencies on cue trials 
with overall 7 may have determined the sub- 
jects’ responding in Shaffer’s study. 

A parametric study using three values of 8 
(.75, .50, .25) and three values of s» (.85, .50, 
.15) sheds some light on the relationships to 
which the subject responds (Myers & Cruse, 
1968). While P(Ai T3) increased with B as 
Shaffer reported, P(A|| T) was mot sig- 
nificantly affected. Trends in both statistics 
suggest that the 7 — 73 difference resulting 
from variations in B was not critical in deter- 
mining the likelihood that subjects would 


maximize their respective responses to the 
two cues; the most significant finding was 
evidence of a complex summation effect of 8 
and «i (or alternately [1 — B] and 72) in- 
dicating that 8 emphasizes the relative 
differences between cue validities, a finding 
difficult for all current models, including 


Massaro's. 

In summ 
mz have, in ge 
increases with incre 


ary, several combinations of 71 and 
neral, shown that P(Ai| T) 
ases in B. The effect, how- 
ever, is due primarily to changes in P(Ai| Tx) 
for ma = -50 conditions, which is markedly 
lower with low B values. For m: # 50, there 
isa summation effect where cue frequency en- 
hances overshooting of the respective Ti values. 


These findings together with conditional 
statistics are difficult for all discrimination 


models. 


Discrimination research has progressed from 


representations of the stimulating situation as 
neutral sets of elements to more complex 
patterns, and from studying the conditioning 

i prediction response to the condition- 
ing of an appropriate strategy. But, problems 
t isolating the pattern as perceived by a 
problem-oriented subject, and of pinpointing 
the strategy deemed appropriate by the sub- 
ject remain. The relative effects of r; 8, and 
cue similarity suggest that subjects are more 
likely to exhibit distinctive strategies on the 
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two trial types when differences that define 
them are greatest. If 71— v» and cue dis- 
criminability are great, the subject is more 
likely to respond. with different responses on 
the two trial types. But, Paradoxically for 
conditioning models, increasing Cue distinc- 
tiveness can also increase the interaction be- 
tween strategies when one cue is 
with complete uncertainty 


the mı — mis high. 


he continuum from a Subject's point of 
View appears to be defined by cue validity or 


the proportion of elements 
to a prediction response. 
in this context does not deter- 
associations of Stimulus and 


» but rather represents an 
operation that confirms or infirms the likeli- 


hood that the Subject is attending appropriate 
Cues. Because many of the effects of cue simi- 
larity and 8 on response Strategies occurred 
very early in learning, it is likely that these 
variables contribute large instructional com- 
ponents that direct the subject into weighing 
different dimensions of the task as clues to ac- 
cepting or not accepting certain simplifying 
response strategies, The relative superiority 
of Atkinson’s observing response model over 
Burke and Estes’ formulations Suggests that 
Some type of attentional mechanism that per- 
mits the subject to restrict his responding to 
certain more valid cues may be useful in a 
nonconditioning context. 

This raises the question of the relationship 
between probabilistic discrimination learning 
and standard sequential learning. The unifying 
thread of stimulus-sampling theory has been 
broken, leaving much of this research in need 
of justification. Two observations that were 
apparent in Simple sequence learning provide 
a rudimentary framework for drawing some 
parallels, Subjects in simple sequential tasks 
were actively seeking and testing dimensions 
of the problem, The most salient cues for sub- 
jects in that context were usually sequential 
cues, but if these were too numerous and 
varied, the resulting cognitive strain forced 


; > tend- 
subjects to simpler strategies based on at 


«wt ay in the 
ing to other highly salient cues. Similarly, € 
discrimination task, subjects are encou sto 
by the nature of the task to seek -—— 
an apparently twofold problem. pt the 
€vent frequencies on both trial types ma’ idly 
discrimination task difficult and could ac 
encourage simplified strategies based dus two 
frequency. Subjects must, in effect, so Restle 
simultaneous problems concurrently. d that 
and Emmerick (1966) have demonstrate plems 
Presenting subjects with several | gems 
simultaneously eliminates recoding in e sot 
attainment tasks, Related to this is uei 
(1966) finding that subjects presente tasks 
several simultaneous probability lean da the 
of equivalent z; values tend to pepe 
Ti values. Thus, the memory or cog t 
Strain resulting from the subject's ane 
find two distinct solutions to diem 
Problems with randomized event sche ce 0 
could readily force the subjects SUE 
simplified Strategies based on marginal evi 
frequencies rather than run cues. Clearly, xit? 
regard to the important notion of comple: ro 
this interpretation Suggests that the pube vi 
problems the Subject must simultaneously i 
enters into a definition of task complexity. 


AGE les 
children’s behavior seems cte 
sophisticated than adults’, it might be see - 
that the simple conditioning theories o A 
quential learning would be appropriate ae 
younger subjects. Children may not be poe ^ 
with the gambler's fallacy or with active p! ult 
lem-solving tendencies that characterize a vy 
behavior. But as early as 1958, Ross and pe 
found that older children (15-year-olds) do ve 
hibit negative recency in successive pro 
tions with frequency schedules (r = .50, - ut 
Atkinson, Sommer, and Sterman (1960) Pt 
linear statistical models to 4 more formal p^ 
and found that 10-year-old subjects also ion 
play negative recency, contrary to predict? é 
Tests of the linear model with younger child. 
have not always examined conditional sta ae 
tics that would re al negative recency- Jorat 
for example, found t. 
to 6 years tended to maximi” 
nse probability more than 97 
ut no information Was pre 


Because 


Margin 


al respo 
11 


“year-olds, þ sente 
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on nI 
co tape responding. Observation of non- 
ficult. for in poe in itself, however, is dif- 
Probability he linear and pattern models of 
Studies v Sanne In short, developmental 
Ten, sim P that even with younger child- 

ple conditioning models meet with 


ficulties, 


X. 
Dari (1998 has reviewed much of the de- 
chavior; to researc on sequential choice 
concentrat avoid redundancies this section 
Į ates on the two-choice research since 
to build paner; such research has continued 
ormulatio strong case against early statistica 
al ready ns primarily on the kinds of M 
cen s suggested. Alternatively, there 27 
ome support for the analyses advanced 
$ hits in his 1964 review. Weir assume? an 
differen in probability learning with age re = 
Ypoth ces in rate of development of CORID T 
bene Oni the one hand and xr 
Crepan ing ability on the other. TEM Te 
etwe cy accounts for the U-shaped relati : 
en age and problem-solving perünusme : 
ma] M is defined as selecting the opk 
anal eee Much of Weir’s review, 2 ua 
hic] = applies to the three-choice situation 
P à child is reinforced a certain percentage 
nobs cry (e.g., 33%, 66%) on. v a é ie 
nm. nd never on the remain" Uh a m 
lia a Weir has taken is usett 
ing some trends in two-choice un. " 
inus information-processing Se sug 
Mente by Weir ties in nicely wit 
à S in thinking about sequen 
197 edults (e.g., Bruner et 2» Pooh 
770). More recently, Bogart? (e ae 
üs formalized a limited-capacit y wer x 
proach to binary prediction of 4- an a i 
Vene which has much !n Lam 
Be Tal information-processin? a 2 
{tz assumes that children, apply Sif, either 
> alternation or epe z^ 
M epar event trac. E to the trace of 
_ le child was attending R 
re prediction qm raise several 
id together, ice [c [ie ee 
«Is th Ush ed age nction, bes ati 5 
randon T "e T ed Hd ree-choice Si ue ions 
Specifi T V > situations? | (0) Do 
€ ildre only to i so, how oes vila 
Wiese rules, 7. ge and other independ- 
Pplication vary with °° 


956; 
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ent variables? (c) What other, if any, age- 
specific processes determine the subject's re- 
sponse to binary frequency and nonrandom 


schedules? 


Frequency Schedules 


Goldman and Denny (1963) presented boys 
(ages 5 to 15 years) with 150 probability learn- 
ing trials (r .75). The median percentage 
A, response asymptotically was 86% for the 
youngest age group; remaining groups matched 
7. Craig and Myers (1963) reported more de- 
tailed research in which three groups of children 
varying in age from approximately 53 to 14 
years predicted sequences with 7 = .60 or .80 
for 200 trials. The youngest group under- 
estimated 7, but the older children (ages 
9-10 years and 13-14 years) matched 7. More- 
over, the older children demonstrated negative 
responding while the youngest group 
tended to alternate responses, particularly 
with m = .60. This finding has been replicated 
by Offenbach (1965) for the 53-year-olds with 
z values of .60, .75, or .90, but Offenbach also 
reported that 9-year-olds underpredicted these 


m values as well. 

The apparent incompatibility of these find- 
ings points up the importance of protocol 
analysis. An excellent study by Derks and 
Paclisanu (1967) helps resolve such conflicts. 
Using the most extensive age range to date 
(3 to 25 years), they presen ted eight age groups 
ith 200 binary events (m = .75). Subjects 
ith marginal response probabilities less than 


recency 


Ww 


w 
66 (two standard deviations from T) were 
labeled undershooters; those with P(A) 


4 were overshooters. Derks and 
it a majority of 3- to 45- 
year-olds were overshooters and a majority of 
5. to 6-year-olds were undershooters. The 
of probability matchers increased 
and the number of subjects 
demonstrating negative recency increased with 
age, with the first appearance in the 12-year- 
old group. Thus, previous studies with slightly 
different combinations of overshooters would 
have different group marginal statistics. These 
authors relate the findings to Weir’s notion 
y young children fixate on a single re- 
sponse, while slightly older children can 
organize response output more complexly but 
are not always able to attend to or process 


greater than 8 
Paclisanu found the 


$- 


number 
steadily with age, 


that ver; 
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environmental events to synchronize response 
patterns (such as alternations) with sequence 
patterns, 

The existence of specific response 


in 3- and 4-year-olds has been drar 
demonstrated 


patterns 
matically 


by Jeffrey and Cohen (1965) 
with noncontingent success Schedules (5 — 1.00, 
50, or 33). In the 5 


= 1.00 condition, where 
any response the subject made wa 


as reinforced, 
the 3-year-olds perseverated responses with 


all but one of the 
- to 5-year-old Sroup showed Strong response. 


It is interesting to 


Seneous ones offered 
ar circumstances (cf. Yellott, 
related areas (e.g., Hodge & 
Tune, 1965) has amply demon. 
people behave nonrandomly when 


are absent. It 


Xt that particular 
expectancies, which vary with 


the initial 


Y suggests that these sub- 
Jects were not capable of adjusting their re- 
rding to incoming error informa- 
tion when it is presented in random non- 
contingent Schedules, With older children and 
adults, errors Signal the need to adjust the re- 
Sponse strategy; younger children reveal not 
only simpler response tendencies but less 
Sensitivity to errors as cognitive signals, 


Nonrandom Schedules 


Bogartz (1966b, 1969), noting the biases 
Specific to children, as well as their limited 
attentional Capacities, suggested that traces 
of the prior response or encoded event, if at- 
tended to by the child, will evoke an alternate 
prediction response (i.e., the alternation rule) 
in single-alternation sequences. Bogartz further 
postulated that if the child is not attending to 
either the event or the response trace he will 
guess randomly. Assuming a one-trial memory 
capacity, Bogartz describes a Markov process 


for each subject determined by two independ- 


demon- 

ently estimated parameters. He Poe mean 
strated remarkable success in ert. run 
performance accuracy, number 0: atial statis” 
of a given length, and other sequer mance of à 
tics for the single-alternation a iem 1969); 
group of 4- and 5-year olds a arameter 
despite variations in individual p 
estimates (1966b). 1 led B 

Implications of the trace mode wy inter 
and Pederson (1966) to vary intertria .subjec 
for 4- and 3-year-olds as a reside Hn 
variable (3 and 10 seconds). Subjects 
single alternations with a 10-seconc put sl 
interval were significantly poorer, accuracy 
responded with greater-than-chance a carry 
The authors concluded that M esign 
over 10 seconds! Usinga within salient (tot? 
and long and short intertrial caen y^ 
trial times were 16 and 8 seconds, gas 
Bogartz (1969) found again that pen ees was 
accuracy on single-alternation cem An eX 
poorer with long intertrial interva pe 
tension of the trace model that assu! SEE. 
probability of entering a null trace ie. cou 
monotonically with an intertrial intervé 
predict much of these data. y «inward 

Bogartz’s concept of children’s jiffere” 
attending" is useful in describing alway 
response-based tendencies that do not ‘In ex 
relate to the ongoing event Seen aetna 
tending this concept to other than quU chil- 
Sequences, Bogartz (1969) assumed the 


ogart 
zals 


the 
pem ased on 
dren may use a repetition rule imn, unt 
i M H s p 
pror event or response trace, as we [che 


alternation rule, depending on the nature alt ] 
Sequence or the stage of learning. At etm n 
the relationship between the subjects se Sieb y 
of a rule and Sequence properties is not fict in 
Specified so that it is difficult to 1 Sd b- 
advance, except intuitively, what rule the ofi 
ject will apply in any given sequence. A see the 
difficulty involves the failure to specify +f. 
circumstances in which a subject e ree 
ferentially respond to stimulus traces an 2 
Sponse traces, since the possibility that pa 
9 not always use the same rules for the p- 
ces is suggested by Bogart ces 
*S applicable to response tra 
age-specific response biases, then 5: 
experimentally examine differentia] rule a 
Plication in, for example, 4- and S-year-0l0* 
nonalternating event sequences shoyjq prese? 


a 


= Sie x 


— m 
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iie picture of the relative contributions 
I imulus and response traces. 
ELO regard, Jones* compared the relative 
a "iac of 3-year-olds and 5-year-olds 
CE nang pe = .1] and repeating 
found th im .9] semirandom sequences and 
on the xs Koth age levels were more dependent 
itas A sequence, despite the fact that 
childrer e ‘Ronpralerred sequence for the older 
atemat More importantly, for the 5-year-olds, 
n fepe lon tendencies interfered with learning 
that EE sequence, a finding that suggests 
Wär jae es specific to response traces or "in- 
applied tention _ may interfere with rules 
attend ^ the stimulus traces. Thus, failure to 
Some ] the stimulus trace may be correlated at 
Ero with regression to age-specific ex- 
fespnnse i which represent rules applied to 
$ Bae traces, a correlation suggested by 
gartz (p. 386). 
aa responses to other nonrandom 
1968) have been reported by Offenbach 
varied ks Myers and Myers^ Offenbach 
With t (E:2| Ex) (0.00 or 0.50) with v = .75 
là he groups of children (75 to 12 years). 
Me leyal could pick up these contingencies. 
ages: ees Myers used four age levels (mean 
9. > years, 7 months; 7 years, 8 months; 
A a 6 months; 11 years, 7 months), pre- 
or ma each with run lengths of 1 and 5 only 
x ag of 4 and 5 in which the probability (8) 
fai. longer run was either (5 or .2. At cer- 
Men points, errors decrea d ae 
PME ga points, repetition responding d- 
> esd with age. All groups showed EEES 
poo RR contingencies, the 4-5 en ent 
Ero 8 relatively easier. Even the rus ds 
Seo Up exhibited negative recency 1N pe em 
5 luence with 8 = .2- Although the perc Fa 
Wia Subjects showing negative recency e b 
8 as not reported, the finding suggests tha t - 
“ervations of negativ y appearing irst 
at age 1967) are specific 
age 12 (Derks & Pa 
9 frequency schedu there! axe a 
Steater number of differen a lengths. Be- 


sed with age; at 


e recenc 
clisanu, 
Jes where 
t rur 


iti babilities on 
Jones, S r frect of repetition pre 
childrens! S. J. The pet j Unpublished doctoral 
liket., auem of Massachusetts, Amherst, 
1968, ion, Univer?" P v i 
M ds pi A deve lopmental 
Mens M v Myers: J ane za 
stud ers, N. A eure in binary prediction. (Tech. 
ep. x sequence eB . University of Massachusetts, 
1068. No. 3), Amh 
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cause no 3-43-year-olds were sampled, the 
possibility that Weir's U-function is specific 
only to random schedules remains. 
Alternatively, Myers and Myers (see 
Footnote 5) suggest that such findings are 
compatible with the Gambino and Myers 
theory that people respond to runs as 'im- 
perfectly discriminated cues. Their contention 
that age differences are attributable to dif- 
ferences in capacity to discriminate among run 
cues is reasonable, but remains to be formally 
tested. Moreover, as research on sequential 
variables has indicated, many data apparently 
harmonious with run-conditioning models are 
also consistent with run-encoding assumptions 
of information-processing models. Experi- 
mental distinctions between à discrimination 
process and run-code confusion may prove 
difficult. 
Thus, in the Myers and Myers study the 
fact that even 5-year-old subjects showed 
negative recency with the 4-5 sequence can 
also be interpreted as support for the notion 
that children can encode sequential events into 
run codes if the task is not too complex. Just 
he number of run codes in a 
sequence may tax the cognitive capacity of 
adults, the greater number of apparent codes 
in the 1-5 sequence here may have taxed the 
more limited capacity of children. The rela- 
tively better performance of children with the 
4-5 sequence supports an earlier suggestion 
that people may code two similar run lengths 
as one and thereby reduce the number of codes 
to be remembered. In the case of children, this 
simplification permitted them to reveal mem- 
ory capacity for several prior events. This 
finding, it should be noted, is also difficult for 
Bogartz's model which assumes à one-trial 


trace capacity of memory. 


as increasing t 


Response M ode 

The shift in theoretical emphasis from 
stimulus-sampling models to information- 
processing models points up the different con- 
ceptions of the role of the response. In the 
former context, the response had no particular 
properties and indeed if response tendencies 
reflected any nonconformities, they could not 
be handled easily. The nature of response 
traces, however, becomes relatively interesting 
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in the latter approach in which interfering 
Properties of response traces are crucial. 

In two experiments, Bogartz (1965) tested 
the linear statistical model's ability to handle 
the sequential responding behavior of very 
young children (4 years, 3 months to 5 years, 
7 months). With frequency schedules having 
T = 80 and .50, he found that children using 
manual predictive responses showed both 
strong response alternations and position 


preferences which the linear model could not 
handle. In the second st 


udy, 12 of the 16 
subjects in the first experi 


ment were used in a 
similar test except with verbal prediction re- 
sponses. Both first-order conditionals and 


other sequential statistics proved compatible 
with predictions of the linear statistical model 
in this experiment. However, in a subsequent 
reanalysis of the data of Experiment Il, 
Bogartz (1969) found that his trace model with 
two parameters could describe these data al- 
most as well as the four-parameter linear 
model, if he assumed children were using a 
repetition rule, 


Since both Kessen and Kessen (1961) and 


Offenbach (1965) reported alternation be- 
havior with young children using verbal re- 
sponses, Green and Myers (1968) compared 
the two response modes using a frequency 
Schedule (r = :60) and two age levels (mean 
ages: 4 years, 2 months and 5 years, 8 months). 
Neither €vent-alternation nor response-alterna- 
tion behavior varied às a function of response 
mode, but response alternation was found to 
decrease over the 200-trial session. These 
authors speculated that experienced subjects 
may have been responsible for Bogartz’s 
Superior fit in the second of his two experi- 
ments. Asymptotic first-order conditional sta- 
tistics did not rank order according to predic- 
tions of the linear model in either age group, 
but it is possible with a comparable number of 
trials there would be greater correspondence 
with Bogartz’ findings. 

The interfering role of ve 
in the context of 
has been studied 
sented children w 


rbal response traces 
an information-trace model 
by Bogartz (1969), He pre- 
ith two decks of cards; one 
deck was a binary single-alternation deck, the 
second had four different cards, two of which 
were identical to the members of the single- 
alternation deck. He required 4- and 5-year- 
olds to name one or three cards from the inter- 


s of 


z prediction 
polation deck after each of 65 se 
the single-alternation deck. Mean pe Jecrease 
accuracy on the alternation = ere Gen 
from the usual value of 80° to ont e 
Bogartz interpreted this as kai response 
naming response enters an interfering lies hi$ 
trace into memory and the child M d that 
alternation rule to this trace. He pi was 
the children's use of the alemana n rather 
rather resistant to change even with t T s 
large error rate, a finding that again wi 
that children either appropriately ^. e 
appropriately do not use errors as 
signals to revise Strategies. 


Transfer 


p 
i -ildren's 
Not surprisingly, investigations on pie 
ability to respond to MONS rd that 
higher-order event pendones In wi 
Sensitivity to sequential shifts incre? 
age. p 
"na and Liverant (1960) found that d 
ibut 
jdre? 
3: 


cor 

E Ge "his "s 
with reinforcement probabilities. x dif 
clusion seems to ignore any maturatl and 


(mean 
ths 


"n or : 
Shifted half of each group to m = .70 0 


aed 
os -es consist 5 
aining sequences con? Es 


With performance on th pee tO) 
quence. In general, the children’s ability. ex 
respond accurately to the more De. 
regularities of the training sequences Ez 
poor, but a temporal delay prior to E, 
in each case enhanced accuracy on the SCC ha 
Event (i.e. the first Ey). The effects of suc e 
delay are Somewhat difficult for a trace mo n- 
such as Bogartz’s. The delay may have e 
phasized the respective patterns as units 


* ation 
he single-alternatio 


pm 


2 
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the chi " 

eus thus increasing the chances that 

ise i would recall them together. In any 
; itis possible that training subjects to 2 


Criterio 
) oi : 
} n of learning prior to transfer would 


ave re : 
Ned interference effects from the 
event probabilities ( Variations of first-order 
ing trials a ities (.25, 50, .75) for 7> train- 
Performance, interfered with alternation 
ing n out training with simple repeat- 
older child FER did not. As in other studies, 
sing k zen were more accurate 1n transfer. 
cues (1 iem extreme first-order conditional 
teheaking on. .9) to define alternating an 
Compared Tone s Jones (see Footnote 4 
and 100 te -and 5 year-olds over 100 training 
aoe est trials in a factorial shift design 
AV Both age levels showed greater 
Carlier ind sequential properties than in 
Breater "s 105; and the event dependency was 
children ree sequences and for older 
epender agam, the children revealed limited 
after m5 on the transfer sequence even 
ransfer tials, with greater interference in 
alternati occurring in subjects shifted from 
Mg the eas to repeating than those expertenc- 

esr verse shift. 

Yes of ce the limitations of age ranges and 
BÉ sub MENGES studied, transfer data reveal 
Fn spi become more sensitive to 
z al shifts with age. Existing work sug- 


Ests 

se that children are most likely to respond 

Sequential properties early in training, and 
raining are more 


dig, parence shifts late in t = 
also į t. The relative nature of the shift is 
3 ine oie Children are more likely to 
Seque to alternating sequences after repeating 
Uences than the reverse shift. 
9 summarize, in random tv 


ule 
ad upport 


complex hy- 
limited event 
se persevera- 
kes their 


le 
Poth and abilities to gene" e 
e ses. Younger children show 
gern dency, and strong id E 
tendenci 4 fact W ic 2 
argi cies, a E j 
argina] responding appear optas. preci 
intermediate 2865 show mo complex 
Vrategie 4 as response alternation, and 
this oft s Mus a in lowe" P(A1) scores. Older 
R en re si us s ; 
chil n result: - sensitivity to preceding 
dren show greate c . 
E nce and evidence 


é ue 
tee in a rando? "ding 

fEative recency ipn m i 

Y weits hypothesis for non- 
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where different optimal 
solutions exist, has not been assessed. But in 
such schedules, all children reveal greater 
sensitivity to the event structure and to run 
cues than was apparent with frequency sched- 
ules. Evidence of, perhaps, rule generation is 
suggested in studies using alternating and other 
nonrandom sequences. Whether this perform- 
ance reflects children's ability to generate and 
test formal rules or whether it reflects merely 
the retention of certain dominant expectancies 
remains a fine distinction. If children do ap- 
proach a sequence with the ability to attend 
to sequential cues, the manner in which this 
ability changes with age and task complexity 
becomes important. However, an interpretation 
of the data suggests that age and complexity 


random schedules, 


may interact. 
This interaction can be examined by assum- 


ing that at least two important processes 
change with age: expectancies and cognitive- 
processing capacity. Age-specific expectancies 
become apparent in the stereotyped responding 
characteristic of any situation devoid of dis- 
criminable cues. T 


he noncontingent Success 
schedule is the mos 


t obvious example of this, 
but from a child's viewpoint other situations 
may also seem devoid of identifiable cues. The 
disproportiona 


te difficulty of frequency sched- 
ules may so tax the child's limited cognitive 
capacity that in this situation he is unable to 
identify reliable sequential cues or to revise 
initial expectancies. The fact that 5-year-olds 
do show sensitivity to run cues with simpler 
sequences (i.e., few run lengths) supports the 
notion that the complexity of frequency sched- 
ules can contri 

ies. A majority 


bute to children's apparent in- 
sensitivity to sequential properti 
of studies have used frequency schedules, and 
lisproportionately taxed child- 
essing capacity, thus demon- 
strating primarily that in the face of cognitive 
strain, the subject relies on dominant ex- 
pectancies. In a related context, Siegel (1969) 
has shown that there is an interaction of age 
and task difficulty such that the processing of 
greater amounts of information is differentially 


more difficult for younger children. 
Thus, strategies that children demonstrate 


with frequency schedules may reflect a com- 
bination of attending to salient nonsequential 
cues and a reliance on dominant expectancies 
— both of which may result from cognitive 


so may have ¢ 
ren’s limited proc 
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Strain. The role of attention, therefore, 
be more prominent in children th 
because the cognitive capacity of children is 
more limited. The dominant hypotheses or 
expectancies on which the subject focuses 
initially vary in nature and complexity with 
age and often determine characteristically 
different initial Strategies. In adults, cognitive 
Strain is initially reduced by differential weight- 
ing of sequential cues determined by these ex- 


pectancies. This takes the form of early biases 
toward alternations of ru 


recency. The ability to ident 
may be reduced by task cor d 
(1960) Teported that adults tra; 
plex schedules were less sensi 
Sequential regularities than those experiencing 
simple schedules, In children, frequency sched. 
ules may have more marked effects so that 
initial expectancies often Operate on salient 
nonsequential cues such as recent stimulus or 
response traces, In young children, response 
perseveration is congruent with a “correct 
Strategy” in many cases, while response alter- 
nation typical of 5 and 6 years is not. Both 
May persist inappropriately with frequency 
schedules as a result of children’s inability to 
cope with error information in complex cir- 
cumstances. The relatively greater prominence 
of interfering expectancies in children’s be- 
havior may be the combined result of attempts 

nitive strain and children's in. 
Children's greater 
ations of expectancies 


es that rapid invalida- 
tion of dominant tendencies is 4 relatively 


potent source of error information for children 
as well as adults. 

Significantly, in Schedules less complex 
than. frequency Schedules, some children do 
reveal focusing on sequential cues. But, ef- 
fectiveness of structural regularities in per- 
mitting children to attend to higher-order 
Sequential cues and so eliminate some ir. 
relevant hypotheses is unknown. If, as some 
have Suggested (e.g. Myers, 1970), Structural 
constraints increase cognitive complexity, 
children may be less adequate in deterministic 
Sequences than in Semirandom ones with com- 
parable numbers of run codes. However, child- 
ren’s ability to use the Sequential regularities 
in language, even at Very young ages, as well 
as their remarkable imitative abilities, would 


may 
an in adults, 


e fact 
scem to suggest the contrary. Indeed, en 
that children learn to speak before the) ts that 
the abstraction of counting See ly at- 
higher-order regularities are more = s. An €x- 
tended to than component pue t 
amination of children’s ability to P have 
recall deterministic sequences At emphasize 
significant bearing on theories that ming, a^ 
processes such as counting, rule iie 
hypothesis testing in sequential lea 


an 
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Specific item content determines response con- 
sistency. The impetus of a number of early 
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other than item content, such as response biases 
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guessing tendencies, Following a number of 
empirical studies in the personality domain, 
Jackson and Messick (1958) emphasized the 
importance of these response biases by high- 
lighting a distinction between content and sLyle. 
They concluded that the major common factors 
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nbiguous on this point. This 


fore seen to be an 
review aims to explicate these two processes, 
a the other as well as 


to differentiate one from 
from content-specific responding, and to de- 
scribe some of the experimental evidence 


bearing upon this conceptualization. 


CONTENT AND STYLE POSITIONS 
Proponents of a pure content position in the 
content style controversy would contend that 
all stable individual differences on personality 
instruments reflect the responses of individuals 
to the specific semantic content of the stimuli. 
Proponents of the style position believe that 
some stable individual differences on person- 
ality measures represent consistencies in re- 
sponse to general characteristics of the stimuli 
other than and in addition to specific item 
meaning. Social desirability, extremity, and 
acquiescence have often been considered major 
contributors to noncontent variance. It should 
be noted in passing that interpretations of 
response style have frequently included refer- 
ence to its interdependence with content 
(Jackson & Messick, 1962b), contrary to 
Rorer's special definition. 
The person who says “true” when asked 
whether or not he is "friendly" may well be 
saying “true” for a number of possible reasons. 
A strong version of the pure content position 
would hold that a person answers “true” if 
and only if he really is friendly; for example, 
because he smiles when he meets people or 
because he is courteous. The strong version of 
the content position, however, can immediately 
be seen to be unrealistic since it requires 
virtually perfect accuracy in self-perception 
and self-report. A weak version of the pure 
content position would hold that a person 
answers “true” only if he believes himself to 
be friendly. Both versions of the content 
position would require evidence for the con- 
vergent and discriminant validity for the 
particular trait in question. If a person re- 
ports that he is friendly on a personality 
questionnaire but is observed by others to 
behave in an unfriendly manner, the pure 
content position must account for this dis- 
crepasicy. TE the disctepancy is attributed to 
consistent inaccuracies in self-description or to 
idiosyncratic modes of expression, then the 
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has a relatively undifferentiate 
as revealed in a 


or because he 


d self-concept, 
general tendency to indis- 


criminantly endorse many traits as self- 
descriptive, 

The distinction between content and style in 
Personality assessment 


was highlighted by 
Jackson and Messick (1958) to emphasize the 
importance of individual response consis- 
tencies that appeared to have 4 basis, not 
in the Specific Personality variable of interest 
to the scale Constructor, but in other expressive 


Consistencies in Personality influencing the 
questionnaire response. Jackson and Messick 
(1962b) elaborated this distinction as follows: 


in defined assessment 
of broader 


€ item content used to elicit 
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to item 
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With the S ges that findings by Block 
€ rating MPT of “correspondence between 
rules ar s and the responses simultaneously 
ing int either a social desirability or 2 
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Is THERE A CONTENT-STYLE 
CONTROVERSY? 


Several publications appearing at about the 
same time a few years ago had the combined 
eflect of giving the impression that there 
really is no more content-style controversy: 
There is no controversy, because styles do 
not exist. The most influential of these pub- 
lications were Rorer’s (1965) review and 
Block's (1965) monograph. A number of item- 
reversal studies also appeared (Lichtenstein & 
Bryan, 1965; Rorer & Goldberg, 1965a, 1965b), 

istencies in response to 


showing sizable cons! 
original and reversed forms of MMPI items. 
These were interpreted as a final deathblow 
to response styles, since the subjects’ observed 
consistency was attributed to content, not 
style. Many psychologists have viewed these 
publications as the culmination of work in the 
response-style area, as well as a signal for the 
termination or extinction of the general con- 
cept of response style. (e.g Janis, Mahl, 
Kagan, & Holt, 1969; Mischel, 1968; Nun- 
nally, 1967). It is often forgotten that the 
work of Rorer and Block was rapidly chal- 
lenged in various replies. Bentler (1966), 
Campbell et al. (1967), Edwards (1967), 
Jackson (1967b, 1967c), and Jackson and 
Messick (1965) primarily focused their criti- 
cism upon Rorer's conclusions regarding per- 
sonality scales, while Peabody (1966) and 
Rundquist (1966) focused their evaluations 
upon what they considered errors in Rorer's 
conclusions regarding attitude questionnaires. 
These critiques, together with a good deal of 
recent evidence, have sustained the theoretical 
and empirical resilience of the concept of 


response styles. 


Although there are potentially contentious 


points at the theoretical level, most of the 
literature. concerned with the content-style 
issue admits to the possible existence of re- 
sponse styles but disagrees on the interpreta- 
tion of the evidence for their importance. The 
basic controversy is thus at the level of data. 


And the data most highly disputed in the 
response style literature are those of acquies- 
cence (Messick, 1967). 

Consequently, we now turn to the question 
of the existence of acquiescence. Prior to pro- 


ceeding to the empirical level, some distinc- 
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tions are proposed that have been made before 
(Jackson & Messick, 1965) and that appear 
to be useful for other investigators (Campbell 
et al., 1967). 


NATURE OF ACQUIESCENCE 
Example 


Suppose each of several subjects is asked to 
indicate what kind of a person he is by re- 
sponding to phrases, Tf he interprets the 
phrase as describing him accurately, he should 
respond “true”; if he does not consider the 
Phrase as an accurate self-description, he 
should respond “false,” The subj 
presented with four phr 
adjectives and 
"sad," “not happy,” and 
the subjects would respond 
indicated in Table 1. F 
each subject responds “true” or “false,” and 
corresponding to his responses, 

im a score of 1 for every phrase endorsed, or 
marked “true,” and a score of 0 for every 
phrase not endorsed, or marked “false.” 

These four items were chosen to represent 
& content dimension Corresponding to a posi- 
live versus negative feeling state. It is now 
possible to compute the six different correla- 
tions between Pairs of the four items, as listed 
in Table 2. If the subjects are responding con- 


sistently to the content of the items, one would 
expect the sig i 


“happy,” 
"not sad." Thus 
to four items, as 
or each of the items, 


let us give 


Table 2, Thus, for ex 
response to ‘ 


Same content dimension, In contr; 


ast, the pair 
"happy"—'not sad” should correla 


te Positively, 
hemselves as 
TABLE 1 


Sunjecr RESPONSES TO Four ITEMS 


Item number 


Item Key 
1 Happy T=1 0 
2 Sad T=1 0 
3 Not happy T ed FO 
4 Not sad T=1 F=0 


Note.—Abbreviations are: T = true; F = false. 
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TABLE 2 
ENCE 
EFFECTS OF CONTENT AND ACQUIESCE 


] iescence 
| Acquiescen 


La 
3 Correlation pair Content. . | Accep- 
E AES tance 
| 

— + 

= + 

+ 

i t 

= + 

- T 


r then’ 
wou 
A rather 


"happy" would presumably also isum 
selves “not Sad," while some person 
respond “false” to both phrases. its ow? 
different set of item responses, with ould De 
consequent pattern of correlations, "ding 
Observed if the subjects are id^ is d 
noncontent attributes of the items. cerns 
acquiescence response style that con 

here, 


Two Types of Acquiescence di 
The first type of noncontent ee a style 
agreement acquiescence, This respons in the 
can be defined as individual diference aul 
tendency to agree. In the personality 1 with 
the definition would include peg, d- 
personality statements. Thus a person riesce ce 
ing solely in terms of agreement and eri a 
would respond “true” to all four fimensio? 
Person at the negative pole of this por 
would respond “false” to all four eniti 
only agreement acquiescence were nieke r that 
responses to the items, then it is a 
all items would be positively correlate 
the ones Opposite 2 . Agre 
tional expectation is presented in the : 
ment column of Table 2. Tt can be bn from 
the pattern of Correlations to be ee aie rom 
content responding is radically differen scent? 
that expected from agreement acquie to 
alone. The Word “agreement” is use (1960) 
Consistent with Couch and Keniston 
and with Messick (1967). sé 
The second type of noncontent yespor E 
that of acceptance acquiescence. This resp ce 
Style can be defined as individual dine a 
in the tendency to consider characteris hi$ 
€scriptive. In the personality qomain, 


of 


js 


— — ~~ ws . 
EEE EEE 
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ition would include accepting character- 
Ee self-descriptive. Thus an extreme ac- 
T would say "true" to all items that 
ES personality characteristics and would 
Acteristic: to all items that deny such ie 
in the ics; an extreme rejecter would repon 
all ch ppoe fashion, disclaiming any um 
say [mas de tt Thus an accepter ho d 
to « rue” to “happy” and “sad,” but " false 
would happy” and “not sad.” The rejecter 
Sunc. the reverse. If only acceptance 
the it SCEUUE ‘WEEE determining responses to 
he mene the pattern of correlations Am 
uen would be quite different from those 
ation usly presented. The only positive pos 
ial would be among the two pairs E 
trait ne uoy presence pier e ats 
"not P vas "happy" "sad and ‘not ae 
"true E Since persons would e m a 
dora. M false" to both. Positively * 

~Salively worded pairs would correlate nega- 
n of Table 2. 


tive = 
nVely as shown in the final colum 
acquiescence Was 


;.€ Concept of acceptance Eo 
& Me 1 


fir A 
e lineeted in 1965 (Jackson 


Bach 
ckeround for the Distinction 
pA Cquiescence was initially defined as à 
quency to respond “true” to test Xen 
ue Cronbach, 1942). Jackson and I 
nc oned that if acquiescence Were simply * 
an ess of "ne^ responding (agreement), 
nd ; A 
a if acquiescence accountec 
Cant porti iance on 
ag. 1t portion of varia I s € 
‘Steeing tendency should manifest ues aS 
actor separating the true-keyed anc 
Pd Subscales of various tests wr 
alysis of such devices- Jackson anc 
x 961, 1962a, 1962b) obtained almost Pi 
Verification of this notion wita o. Le 
"s three different samples. Block Ey: is ‘factor 
a argued that the existence. E 
ic 4 content In a : 
we not rulé out & Or. -e carried out 
inally, item-reversal studies were oe Guid 
ichte 1965; Rorer & Gold- 
D Ichtenstein & Biya were written that 
erg, 19654, 1965b). Items * 


igi *sad" or 
Ren he originals (as 
€ the reverse 9 the g ; 
cham be considered content op- 


t 
AGE. h S)! may 
bodies 47 a e » in our example), and the 
ts of "'hapP?* ud Tr 
Correlati betwee! original and reversed 
i PE i The investigators re- 


ltems were examine® 
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ported correlations between original and re- 
versed items and scales almost as high as 
estimates of item and scale reliabilities, a 
finding interpreted as apparently indicating 
strong content consistency, although content 
was not otherwise defined. Data such as these, 
as well as data showing in certain cases low 
intercorrelations among measures of acquies- 
cence taken from different tests and lack of 
consistent correlates of acquiescence with ex- 
ternal criteria, led Rorer (1965) to the con- 
clusion that acquiescence was a myth. This 
conclusion unfortunately left certain critical 
features of previously published data unex- 
ed, such as the perfect factorial separation 
of true- and false-keyed MMPI scales. 
Problem wilh reversal studies. Two distinct 
types of item reversals were used in the re- 
versal studies cited above. One may be called 
a polar-opposite form of content reversal, the 
other a negation form. In the example cited 
in Tables 1 and 2, the pair "happy" "sad" 
would represent polar opposites on a content 
dimension, and thus “sad” could be con- 
sidered a polar-opposite reversal of “happy.” 
In contrast, “not happy” would represent a 
negation form of content reversal, since the 
phrase does not define the opposite pole of 
the content dimension. With items scored as 
indicated in Table 1, it is quite clear that the 
correlation for both types of reversal would 
be negative in sign if content were determining 
subjects’ responses. This effect was seen, for ex- 
ample, in the first two rows of Table 2. It is also 
obvious, as shown in Table 2, that the expected 
correlation from the agreement hypothesis is 
opposite in sign. Since reversal studies reported 
a reasonably high negative correlation for both 
types of item reversal, it was clear that agree- 
ment acquiescence must be a relatively minor 
determiner of responses on inventories such as 
the MMPI. 
However, there remained a major problem. 
The high negative correlations observed in 
reversal studies would be equally consistent 
with an acceptance hypothesis as with a con- 
tent hypothesis if negation reversals were 
used in these studies; this effect is illustrated 
in row 2 of Table 2, for example. An examina- 
tion of the various reversal studies made it 
obvious that a large proportion of their re- 


versed items were negation reversals, rather 


plain 


192 P. M. BENTLER, D. N. 


than polar-opposite reversals. For example, 
over 100 of Rorer’s Were reversed by prefix 
the phrase “Tt 
the existence of strong “content” Correlations 
could provide equivalently good verification 
for acceptance as for content. Studies which 
thus appeared to 
have a flaw, These issues were reported quite 
Some time ago (Jackson & Messick, 1965), 
but to date only a few others have taken y 


et al., 1967). 


Ubiquitous MMPI. A number of 
about the MMPI remai 


content Proponents, 


tors to reflect different 
(e.g., Block, 1965; Wiggins, 1966). Thus Rorer 


(1965), for example, concluded that “i 


it is an 
arbitrary decision to label the factors in terms 


ases, or styles, rather than 
In terms of the criterion £roups which the 
MMPI scales Were originally devised to dis- 
criminate [p, 142]? But what could be the 
nature of this Content, which despile the 
different content names given it (e.g., depres- 
sion, Schizophrenia), appeared to implicate 
virtually all MMPI items and scales in only 
two major common dimensions? Clearly the 
two major MMPI factors represent such a 
Seneral kind of “content” as to be something 
quite different from the content assumed for 
it by the Constructors of the MMPI. The 
definition of content as implied in the use of 
different construct. or trait names for the 


aspects of content 


uire some convergent 
and discriminant properties, a property in the 
PI notable for its rarity. One of the 
require every true. 
keyed subscale to measure something that js 
different from and opposite to what every 
measures. Furthermore, 

the same trait would have to be measured by 
all true- and false-keyed subscales. A content 
require an explanation 
for this single dimension in which the true- 
keyed scales when keyed positively for con- 
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zeyed 
tent measured one pole, and the fale 
scales when similarly keyed. ae 
Content measured the opposite " ® pility 
would be analogous to a true-fa se for à 
test in which all items keyed € “false” 
correct response and all items keye factor in 
for a correct response load the bnt, general 
Opposite directions, A Hypothesis correct- 
ability factor would ordinarily havea conl 
keyed items loading in the same ae as those 

Subportions of MMPI scales, such 2 


ú items; 
identified by true- and filiacion highly 
would seem of necessity to correla fre- 


in the content direction; yet this ber eat 
quently not the case, Indeed, it in tota 
that subscales added together to oap a hig 
score would possess components ped y 
negative relationships, a fact also rev p E 
the negative relationships between ; (Jackso? 
of true- and false-keyed MMPI scales lifferen 
& Messick, 1961). Consequently, the e ac 
content scales did not provide their o 
tors in factor analysis, sates By 
Another MMPI factor also implicate orta 
scale, seemingly in terms of an ui A. t 
Connotative stylistic attribute of respo bility? 
Personality items, namely, their TT his 
The Correlation of an MMPI scale V ledge 
factor is easily predictable from know ° items: 
the social desirability scale values of its ed bY 
Yet this relation must also be espiat 
content, In addition, MMPI scales are in this 
ized in an interesting circular pattern n the 
two-factor space, with the loading © les 


ery 


-eved sC& 
factor Separating true- and false-key ed come 
Systematically decreasing as scales i 


i" 
satio! 
: . ays H rcanizat^", 
more extreme in desirability, This orgat 


mi” 
of subscales within a factor structure ee 
Y two large factors is not we 
with any published account of the rac 
on which scale Construction for the hec 
MMPI was based, nor have we pem e 
any theorist who could provide a me 
vontent interpretation for this circular PIM 
Block’s (1965) reconsideration of the M osi 
representing very broad a on 
appear to explain this ee 
tructure of the MMPI. It ter- 
not a relevant rejoinder to this stylistic 1n t 
Pretation of the MMPI internal Sree i 
demonstrate that there are Possible me 
Correlates of these two large dimensions, 


[: 
pe noted earlier, such a finding would 
tion ( pconsistent with a stylistic interpreta- 
1 Jackson & Messick, 1958). 
Bons mk. definition of content 
tinct me t Fiske, 1959) requires that dis- 
ighly hep of separate traits be not so 
cal and adis ated as to preclude their statisti- 
Scales false we oh independence. Personality 
tion in ter a show appropriate discrimina- 
be expect we of internal structure can hardly 
constructs, to be associated with independent 
tepresentin, Neither can they be dignified as 
mately | ERE independent content nor legiti- 
Fac: be referred to by different names. 
of dic weer of Rorer's reversed form 
E en ue (Jackson & Messick, 1965), 
again "s separately for males and females, 
Mea 15d the existence of the two major 
ou my including the true-false factor. Even 
e gn the scales were reversed in direction, 
iis factorial separation of true- and 
Mex scales existed. The loadings of 
TA scales on one of the two major 
e fal ions were opposite in sign to those of 
Actor Se-keyed scales. The usual type of two- 
s e taking account of appropriate 
1958) . reversals, resulted when a Tucker 
laken Operis ^ factor analysis was under- 
Bether. ^ the original and reversed forms to- 
e n 1 his analysis, which focuses only upon 
Verse Y ow in common to original and re- 
avoj d forms, was undertaken to completely 
id the problem of item overlap. The results 
nd analyses were essentially identical to 
Wo S cunei reported ; indeed, the first 
accou actors were more massive than ever, 
unting for 94% of the total variance. 
Seite together with the previously pre- 
quit information that the reversal data were 
jte amenable to a style interpretation, this 
erbattery analysis is also consistent with 
B Style position. The existence of only two 
* factors to account for the correlations 
count fo original and reversed scales—to ed 
Consis or virtually all of the observed -€-—— 
mis d between the two fous wo 
Cont to violate reasonable requirements DE a 
(od position that attributes distinct con- 
to different scales. 
fr In the remainder of the review we turn to 
€sh data to evaluate the two acquiescence 
Styles described in this section. It is appro- 
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priate to present a few highly pertinent studies 
in some detail, in order to avoid a superficial 


treatment of masses of data. 


RECENT EVIDENCE FOR ACQUIESCENCE 


Existence of Noncontent Responding 


The types of endorsement patterns to be 
expected for phrases and their polar opposites 
described previously, and 


or reversals were 
the two types of 


the effects of content and 
hypothesized acquiescence were summarized 
in Table 2. The types of correlations to be 
expected from a content position as well as 
from the two acquiescence positions were de- 
scribed there for each of the various possible 
kinds of item comparisons. The most definitive 
contrast to be expected from a content and a 
noncontent position can be seen to be in the 
comparison of polar-opposite terms. A con- 
tent position would indicate that polar op- 
posites should correlate highly negatively, 
while both the agreement- and acceptance- 
acquiescence positions would demand positive 
correlations among polar opposites. Thus a 


study dealing with polar opposites presented 
items of Table 1—for 


in the fashion of the i 

instance, adjectives—would yield quite con- 
trasting results from the content and style 
positions. Such a study could not disentangle 
the two types of acquiescence, but it could 
demonstrate the existence of acquiescence per 


se. 

The correlation 
in Table 2 hold only 
content or acquiesce 


al expectations summarized 
for the ideal case of either 
nce responding. If both 


content and acquiescence responding influenced 
the results as components of variance, the 
expected correlation between polar opposites 
would be some compromise between negative 
ssociated with content and posi- 
sulting from acquiescence. 
The correlation might even be zero! A zero 
correlation by itself would be difficult to 
interpret, however, since it could be due either 
to error of measurement or to SOME compromise 
of the two response dispositions. As a result, 
it was necessary to design a somewhat more 
complicated study than the simple and obvious 
one. 

The most agreed-upon expectations regard- 
ing the content correlation of a set of scales 


correlation à 
tive correlation re 
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could be derived from the widely cited mea- 
Surement work of Osgood, Suci, and Tannen- 
baum (1957). These authors established that 
evaluation, activity, and potency 
Connotative dimensions in an ind 
scription of common objects. Be 
hypothesized that ac 
could be described cle 
Connotative meaning. 
of the polar-opposit 
“bad”) to assess each e 
tic differential factors. 
tives were collected : positive evaluation (E +), 
negative evaluation (E —) 

(A +), negative activity ( 
tency (P+), and ne 
An adjective checklis 


define major 
ividual's de- 


person’s concept score 
dimension was the 
checked. One hundred concepts were rated by 
200 undergraduates, From these data, a scale 
of 25 adjectives was selected by monotonicity 
analysis (Bentler, 1970) to assess each of the 
Six poles mentioned above, The number of 
adjectives (out of 25) checked by a person for 
& concept on one of the six scales represented 
the Concept score for a given person. These 
total scores formed a 200 X 6, Person X Scale, 
data matrix, The six scales were correlated, 
yielding a set of relationships among each of 
the scales, 


What type of results would one expect if 
only the Specific semantic content determined 
a scale's tota] Score—that is, what should the 
Correlation matrix look like if paper-and- 
Pencil tests are unaffected by any type of 
acquiescence? Obviously, if two scales for a 
given semantic factor measure opposite ends 
of the same factor, these opposites should be 
highly negatively correlated, Thus, E + and 
E — should be highly negatively related, as 
Should A+ and A —, and P + and P —, 
Cross-factor correlations should be reasonably 
low—certainly lower in absolute value than 
the absolute value of the correlations of polar 
opposites, Alternatively, if either agreement 
or acceptance acquiescence is a major de- 
terminer of scale Scores, polar opposites would 
tend to be less than perfectly correlated since 
stylistic and content response tendencies would 
be working in Opposition, Acquiescence as a 
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by 
se sed b, 
Reis em e asses 

stylistic characteristic would be a across 


arate 
E separa 
all six scales. It is not possible to 


at 

s sre, so tha 

the two types of acquiescence em of both 
this score represents the operatio yet the 


processes in unknown cambia orthog- 
well-established bipolarity and re z cales wer® 
onality of the factors on which the bility o 
based would largely exclude the iym acquie* 
a subject obtaining an extremely t considera- 
cence score on the basis of conten determinee 
tions alone. If only epe días 
responses, all scales, even mee posi y 
opposites, would be significan "affected. 1? 
related. If specific content also sites woul 
sponses, the correlation of polar T these tW? 
represent some compromise bewe E 
opposing correlational tendencies. cales in the 
The correlations among the six dy are pre 
adjective semantic differential aes am ng 
sented in Table 3. The ge non close 
polar opposites are in all three raed all ^ 
zero, but positive in sign. Further ~on 


itude i 
area are larger in absolute magie. op 
the three correlations representing Pe relatio? 
posites of the same scale. Thus, mee r 
between two subscales, A F anua " 
senting different content, is rela two sca 
(.65), while the correlation between gait; 2 
designed to measure the same w a dire’ 
and E —, is low and positive i n the 
tion opposite to that expected pe conte : 
pothesis of consistent responses s supp?" 
Thus, these results provide Á— are DO 
for an acquiescence hypothesis ane ontent n 
consistent with the hypothesis that co nant ° 
the sole or even the major determi! 
responses to descriptive adjectives. aca! 

Even better evidence favoring Een j 
cence hypothesis can be obtained ; cequies? 
interrelations among the scales. If od the 
cence, as well as content, accounts artial” 
Pattern of Correlations in Table 3, then Pi the 
Ing out the effects of acquiescence prem 
correlations should yield a pattern of re ‘pase? 
Ships which Corresponds to expectations as? 
on Content-determineq responding. bs Ui Í 
Measure of acquiescence the tota] numbe 
adjectives che 
incidentally, y 


ef 


ries" 
he 


cf 
hic 

Cked across all six scales a 

ielded a Coefficient alpha inte? 


add 
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4 TABLE 3 TABLE 4 
EMANTIC ADJECTIVE DIFFERENTIAL CORRELATIONS SEMANTIC ADJECTIVE DIFFERENTIAL CORREL. 
RES ATIONS, 
Adjective wells " | : Š ACQUIESCENCE PARTIALEÐD OuT j 
sel E E- £ &-— "E = 
Adjectiv | 
| jective | Ep | E- | At | Am | py | pH 
Et 1.00 ! 
a 05. 100 E+ 1.00 
AT .65 .22 1.00 B= —.76 1.00 
Py 23 .63 .08 1.00 A+ 33 = 0 3:00 
P- 30 57 59 35 1.00 A— —.40 33 —.77 1.00 
50 39 32 64 18 1.00 + —.36 AS 15 —.28 1.00 
P 06 —.16 —.38 32. —.70 1.00 


ian of .79), partial correlations were 
the eff ion among the scales holding constant 
Dress ects of acquiescence. These results are 
p in Table 4. In contrast to the results 
egi Table 3, the pattern of correla- 
Would. Table 4 corresponds to what one 
a Speck on the basis of content responding 
a opposites correlate highly negatively, 
dessins absolute values of all three polar- 
apes correlations are larger In magnitude 
Com a 12 cross-scale correlations. A more 
tutes lete verification of the hypothesized ac- 

cence tendency could hardly have been 


expected, 
An additional study? verified the applica- 
process demon- 


ad of the acquiescence n 
of ed for semantic meaning to the domain 
M e pi assessment. A total of 193 
Wee ers of children in the age range 0-6 years 
ow asked to describe their children usingthe 

jectives of the adjective semantic differ- 


enti; 
Ntial, Total scores for each of the two poles 
obtained 


ae three semantic factors were ODT ils 
ty ey onan in a fashion quite ng 
werd described in the previous study- de 
(ma left-hand portion of Table 5 presents 
On Correlations among the var 
io ps again, one finds correlations 
i. mn between polar opposites, as well as 
airly consistent positive correlations among 
oe from different content domains. A 
bee adjective acquiescence score was 09- 
3 ined by counting UP the total number of 
Mijectives checked by a mother in describing 
er child. The intercorrelations among the 
Various subscales, Which result when the adjec- 


rious subscales. 
very close 


? Bentler, P. M- Children's personalities assessed by 
an adjective check list. Unpublished manuscript, 
niversity of alifornia, Los Angeles, 1969. 


tive acquiescence scores are partialed out, are 
presented in the upper right-hand portion of 
Table 5. It can be seen that the partial correla- 
tions once again correspond much more ac- 
curately in size and sign to what would be 
expected on the basis of content responding. 

A final study bearing on the presence of 
emerged as a by-product of an 
hich sought to evaluate empiri- 
ive methods in multivariate 
Boyd and Jackson (1969) 
echnique of multidimensional 
als scaling to a set of 18 
] to reflect bipolar 
Rejection, Power 
ition. In addition, 


acquiescence 
investigation W 
cally two alternat 
attitude scaling. 
first applied the t 
successive interv 
attitude items hypothesizec 
dimensions of Authoritarian 
and Toughness, and Superst 
they employed a procedure in which behavior 
descriptions of imaginary persons designed to 
be high and low on each of these dimensions 
were incorporated into the scaling procedure. 
Multidimensional scaling resulted in the find- 
ing that both items and hypothetical persons 
were consistently represented as predicted on 
each of the three dimensions, without any 
TABLE 5 
PTION CORRELATIONS 


Cup ADJECTIVE DESCRI 
AND PARTIAL CORRELATIONS 
| 

‘adjective | p+ | E— ae (ac | BE | P- 
set | 
E —. =—04 dO = 
E+ ay 46 02 
E-— —48 3 —.29 30 d; —.07 
A+ uo g —53 40 —45 
A— AS .44 —.09 —.29 43 
PF 44 At S4 42 — 6] 
P- .25 45 —.01 28 —0 
ing Canela Don ore in the lower left, partial correlations 


are ‘in the upper right. 
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TABLE 6 
DESCRIPTION OF SCALES USED IN ANALYSES 
_ 
Content scales 
Label Content Format Wording Keying 

ESPT Self-descriptive Positive True 

ESPF Self-descriptive Positive False 

ESNT Self-descriptive Negative True 

ESNF Exhibition Self-descriptive Negative False 

EAPT | Exhibition Attitude Positive True 

EAPF Exhibition Attitude Positive False 

EANT Exhibition Attitude Negative True 

EANF Exhibition Attitude Negative False 

PSPT Play Self-descriptive Positive True 

PSPF | Play Self-descriptive Positive False 

PSNT Play Self-descriptive Negative True 

PSNF Play Self-descriptive Negative False 

PAPT Play Attitude Positive True 

PAPF Play Attitude Positive False 

PANT Play Attitude Negative True 

PANF | Play Attitude Negative False 

SSPT Succorance Self-descriptive Positive True 

SSPF Succorance Self-descriptive Positive False 

SSNT Succorance Self-descriptive Negative True 

SSNF Succorance Self-descriptive Negative False 

SAPT Succorance Attitude Positive True 

SAPF Succorance Attitude Positive False 

SANT Succorance Attitude Negative True 

SANF Succorance Attitude Negative False 

USPT Understanding Self-descriptive Positive True 

USPF Understanding | Self. -descriptive Positive False 

USNT Understanding Self-descriptive Negative True 

USNF Understanding Self-descriptive Negative False 

UAPT Understanding | Attitude Positive True 

UAPF | Understanding | Attitude Positive False 

UANT Understanding | Attitude Negative True 

UANF Understanding | Attitude Negative False 

Response-style marker scale 
Label Content Source of items Wording Keying eee 

HMPT Heterogeneous | MMPI Positive True Jt 
HMPF Heterogeneous | MMPI Positive False 05 
HMNT Heterogeneous | MMPI Negative True 45 
HMNF Heterogeneous | MMPI Negative False 52 
HPPT Heterogeneous | PRF Positive True Ad 
HPPF Heterogeneous | PRF Positive False 33 
HPNT Heterogeneous | PRE Negative True 31 
HPNF Heterogeneous | PRF Negative False 25 
FRT F Scale Relative True 26 
FRE F Scale Relative False 25 
FAT F Scale Absolute True 50 
FAF F Scale Absolute False af 
DA Desirability Scale (PRF Form A) Balanced 55 
DB Desirability Scale (PRF Form B) Balaci, .61 
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TABLE 6—(Continued) 
Adjective checklist and other variables 
Label Bs 

AD Description | KR-20* 
AN Desirable adjectives 
AU Neutral adjectives | 
INFR Undesirable adjectives | 
SEX Infrequency Scale | 42 

Sex of subject | 


* KR- 
20 refers to Kuder Richardson Formu 


als. This indi- 


excepti 
ptions and with no revers 
the items sub- 


Cat ; 
E judge agreed that 
of the E defined appropriate poles of each 
Same ea dimensions. However, when these 
ormat E administered in a response 
equivocal. $ factor analyzed, results were 
of massi d primarily because of the intrusion 
and ne M levels of acquiescence. The positive 
to Slee items were all scored positively 
ing th n a total-agree score, essentially cancel- 
Correlati effects of content. When biserial 
and dee were obtained between each item 
of the A ee number of agree responses, 
signifi - Bose correlations were positive, 
Were DEB eden at the .05 level. If responses 
hese è solely on the basis of content, à 
around 7 correlations should have hovered 
format zero. Acquiescence bias in a response 
items a thus be elicited in à set of attitude 
Scalin hat have been shown in a judgmental 
a set d to be associated uniquely with 
Presenc clear, bipolar content dimensions; its 
the p ce produces a consequent attenuation in 
reliability and factor clarity of responses- 
Ry; 
“tistence of Two Processes 
rovide clear-cut 
acquiescence, 
the 


m studies cited above P 
ut ue for the operation of 
BéDarapi provide no data relevant t° © 

di ility of the two types of acquiescence. 
os eroe completed by Morf, and de- 
quite by Morf and Jackson (in pr 
acqui clearly the operation O e 
a B a processes. Morf an 
Caref tman (1958) facet design W! 

efully selected to elicit hypothesized con- 
alysis of two response sets: True 


endorsement. Unpublished doc- 
f Western Ontario, 1 


1M 
Tes Mort, M. E. An am 
wsPonding and itemi ersi 
Tal dissertation, University © 


la 20 internal consistency 


reliability. 


tent and stylistic processes, specially con- 
structed response-style marker scales, factor 
analysis, and an objective rotation procedure 
that would yield an internal analysis of the 
structure of the items which would avoid 
many of the problems inherent in the use of 
the MMPI. This type of design allows content 
to appear under a variety of types of item 

but also allows response styles to 


formats, 
appear in the context of a variety of content 


dimensions if response styles exist. None of 
these properties can safely be assumed to be 
independent or systematically varying on the 
MMPI, where even the identification of con- 


tent poses some problem. 

A two-part questionnaire con 
true-false personality items and an a 
checklist were administered to 7 


108 female university student vo) 
items were Se 


two sessions. These 1 lected ac- 
cording to very exacting criteria so as tO 
provide measures of eight possible response 
processes that might be operating: agreement 
acceptance acquiescence, desira- 

ive acquiescence, and four con- 
Exhibition, Play, Succorance, 

It was decided to treat 
as potentially distinct 


lunteers in 


acquiescence, 
bility, adject 
tent dimensions ^ 
and Understanding. 


adjective acquiescence a 
since it is clear from the analysis presented 


above that adjective acquiescence could repre- 
sent either agreement OF acceptance acquies- 
~e, or some f the two. Tn addition, 


of a distinct method factor. 

created for each of the four 
-ensions according to a very res 
First, items were of two broad 
statements and attitude 
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4 
5 


Statements. Personality statements were de- 
signed as self-descriptive first-person state- 
ments, while attitude statements were written 
to elicit an evaluative response toward an 
external referent, Second, for each of the two 
types of items, the statements were worded 
Positively or negatively, in accord with the 
acceptance acquiescence hypothesis described 
Previously. Third, for each type of wording, 
items were written so as to be true- or false- 
keyed in the content direction. Thus each 
content dimension was represente 
scales. Each of the ej 
composed of from fiv, 
tion to the item at 
each of the items w. 


ave 
Intermediate 


frequencies, low 
Y, and moderate 


this design are 
ble 6, together 
ernal consistencies, 

A number of response-style marker scales 
were also included in the analysis. Four hetero- 
Seneous 30-item acquiescence scales were con- 
structed with MMPI items neutral in desira- 
bility and intermediate in endorsement. fre- 
quency. As before, these scales were worded 
positively or negatively and keyed true or 


false. A similar Set of four Personality Re- 


Search. Form (PRF) acquiescence scales was 
Constructed from the original, prepublication 
Pool of items of the PRF (Jackson, 19672). 
The Clayton and Jackson (1961) absolutely 
worded and relatively worded true- and false- 
keyed attitude scales based on the California 
F Scale also generated four acquiescence 
scales, Finally, the Desirability scales from 
the PRE were also included in the analysis, 
The 14 response-style marker variables just 
described are listed in the second section of 
Table 6, which also lists three adjective check- 
list variables. The adjective checklist con- 
sisted of 45 desirable, 90 neutral, and 45 
undesirable adjectives selected from Ander- 
son’s (1968) list; they were arranged in ran- 
dom order. Separate scores were obtained to 
yield the number of desirable, neutral, and 
undesirable adjectives Checked as self-descrip- 
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" e 
tive. Finally, as shown in Table 6, there Ms 
two irrelevant variables, an Infrequency 5 
and the sex of the subject. «all. The 

Thus, there were 51 variables in a wate 
intercorrelations among the 51 variables were 
computed; eight principal axes factors be in 
extracted from the correlation matrix = pro- 
agreement with the number of im the 
cesses hypothesized to be operating, a7 pre- 
initial factor matrix was rotated in z the 
liminary analysis by Varimax, and eile 
final reported solution by Clustran on at 
1971) to the factor position hypothesiz wert 
the start of the study. The eight anor T a 
correlated, so Bentler’s (1968) factor C repre" 
tion matrix was computed. This esp is 
Sents the orthogonal factor solution plique 
closest in the least-squares sense to the 0 ntage 
solution and permits assessing the perce 
of variance attributable to each factor. [ the 

Before moving on to a description a er. 
factors, a procedural word Seems in ces 
Human intervention in the analysis P dures 
is kept at a minimum by the proce- xi 
utilized. Once items were constructed io jons 
51 variables, no additional subjective id i 
remained to bias the outcome peyor tre 
"bias" introduced in hypothesizing aan 
Sponse processes. The subjects ut ctor 
the correlation matrix; the number of e o 
was ordained, the hypothesized positi Jeter- 
variables in the space automatically i the 
mined the multidimensional position ‘plique 
final factors, the correlation among the 0 ined 
factors was also automatically ete ate 
without intervention. As a check on the 5 re- 
bility that Morf and Jackson’s (in pre 
porting of results based on the Lupa ar 
tion, which involved hypothesizing a partic ð 
Structure, might have introduced a goer 
bias, these results were compared with c 
based on a normalized Varimax rotation, (see 
With the results as reported by Morf :nor 
Footnote 4); the original data contained o 
inaccuracies in item keying. There was à kc 
high degree of agreement between these 20 ry 
tions, The interpretation of the eight prima 
factors was identical, x 

he Orthogonal factor-loading matrix repe 
Senting the factor contributions i 


s presente 
m Table 7. Like any factor 


"loading mati 


ry 


p 
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TABLE 7 
ROTATED FACTOR MATRIX 
Factor 
Variable 
1 | n | HI | IV | v | VI | VII | VIII | ht 
23 3;  -16  —06 55 10 o4  —10 46 
-13  -17 i5 —2l 54 —02 Ho 145 44 
12 —08 13. -04 37 08 4 -H 40 
—25 21 08 06 47 28 09 09 43 
19 g Aie oH 4 2 d =09 29 
—33 —il 04 05 40 05 26 07 36 
al 08 9 —01 25 | —06 oo — —0i 19 
—25 09 08 10 31 o7  —01 11 20 
22 14 14 24 15 a 13 03 35 
—25 —09 —02 —06 12 60 08 —20 50 
a =0 6 13 09 63 o  -—08 54 
—26 26  —M 02 17 42 07 00 36 
21 22 07 02 05 47 i3 — —08 34 
cmo 1$ 0738 02 02 43 —06 —09 35 
2 =i R 10 00 5  -08 —09 6 
dj 12 -01 12 12 45 13 02 3 
04 16 27 11 08 15 55 —17 47 
—31  —09 06 06 14 —07 52 01 41 
11 OF 05 06 17 04 55 P 36 
—08 2 -2 00 04 18 51 2 
31 17 —12 13 09 04 39 —13 E 
—40 15 18 02 9-13 - S n 
15 —11 10 —02 —06 15 
—20 os 0 M —05 09 49 —08 31 
12 31 02 09 u = 07 54 B. 
-22 o5  -06  -—15 01 04 P 9 s 
12 -2 00 09 ob m6 pi a 
—33 20 10 — —06 06 05  -19 i E 
30 13 ie 0? ci mes —07 9 7 
=a  -—12 08 S NET E 15 
Los  —06 = o7 04 17 13 
20 — —18 05 
8 -2 0 —10 00 46 48 
—40 11 Í 
5 —24 20 13 00 08 05 52 
34 53 2: 6 
4 ay Ol} 0k —05 —09 65 
—30 —52 9 "m 
4 —07 06 0 =Z 00 
a 30 Er —06 48 
5 -24 05 08 10 23 0 
E a 22 12 jj —0 O08 09 : 
zd - ia 0 MEUS 
a ow cA i s de = c» 37 
a -5 7» o 00 16 02 10 33 
E se ts 8 2 05 og — —38 40 
A m pa i —03 12 06  —2 24 
E e E S 22 00 Q —35 55 
Ea 2D UME qo ace 16 — —10 30 
—39 17 -04 —28 E E —05 11 48 
—10  -10 65 oe M CO 5 s E, 
NE A 68 >i si o 20 v E 
05 11 56 Z ou i a -0 P. 
10 26 -13 2 — 0l 06 13 —09 56 
05 Ao cM a 1s g =u =ü J 
uuo = =F o -24 09 34 13 22 
S —05 06 -13 7 EN 245 2.52 2.58 2.80 
pum of squares 3.74 2.48 2.70 
ercentage of we s 23 12618. 
pee variance 18.2 12.1 13.2 7.5 10.5 6 i 
he least squares orthogonal fit to the hypothesis 


> a This T : “iding t 
Note.—Decimals omitted. This matrix is the Clustran matrix providing 
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its entries represent the correlation of variables 
with factors. We deal first with the Stylistic 
factors, later with the content factors, 


1. The first factor is clearly an agreement 
acquiescence factor. All true-keyed | scales 
loaded positively on the factor, and all false- 
keyed scales loaded negatively. This Separa- 
tion of true- and false-key 


; Tather than 
indicating the apparently 


sonality domain. 
2. Positive loadings on the 
of two types: those that reflect 


item and those 
a negatively worded item. 
t, and they represent the 
unequivocal Operation of acceptance acquies- 
cence. It js particularly noteworthy that of 
the 44 personality and attitude s 


3. This factor is a rather clear-cut desira- 
bility factor. The highest positive loadings 
Were obtained for the parallel PRF Desira- 
bility scales (.68, .65), and the highest nega- 
tive loading was obtained for the number of 
undesirable adjectives endorsed (— .51). Thus 
the existence of this readily identifiable factor 
casts doubt on Block’s (1965) contention that 
desirability responding can be explained in 
terms of consistent responding to pathological 
content, since the variables just mentioned 


were composed of items from nonpathological 
content domains, 


4. The neutral, undesirable, and desirable 
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+ net IoadiDES 
adjective scales obtained the highest tel 
on this factor (.68, .47, .43), with te ctv 
loading being obtained by neutral fable as 
Thus, this factor is clearly identi 
adjective acquiescence. jt the 

5-8. The next four factors ane 
content factors Exhibition, Play, ob- 
and Understanding. The high loac m de- 
tained for the various content S easilY 
signed to assess each of the factors f Table 
be seen in the appropriate column o „fication 
The first letter of the variable’s identi riables 
Code indicates its content; thus the E d I, 
load highly on Factor V, P variables t thes? 
and so on. It should be pointed out em even 
Content factors are clearly pen d 
though only relatively short scales bene 
of items with moderate content sa clusio® 
were used—too low to permit their e con 
in the final PRF— suggesting that rr he 
tent is truly identifiable as such, Euge. if 
context of an investigation designed Mac its 
and label response styles, it will de z he 
own factors in spite of the operation 
response styles, onse 

Thus it has been possible to identify resP b- 
styles and content unambiguously Ue is 
jective multivariate procedures. Althous ot t 
not necessary to describe this study fur of the 
meet the objectives of this SEHE. tion ? 
review, namely, the empirical identifica” ler* 
two distinct acquiescence processes, an ‘i 
standing of the processes themselves b the 
enhanced by looking more closely " 
correlations among the oblique factors 
were obtained. bli 

The 8 X 8 correlation matrix of the ° rine 
factors was resolved into uncorrelated wer" 
cipal components, and four components "rp 
rotated using a Varimax procsdurs, 
Second-order loading matrix that resulted 
this procedure is presented in Table x 
number of interesting features can be See”, 
that table, Accep hig 
own factor (T), but this factor also has rs tin’ 
oading from adjective acquiescence, indica ^ 5 
that adjective ac 
sent 


that 


que 


from 


Tue-saying process, It W! tio? 
remembered that the adjective presenta ge 
ormat, by itself, did not allow one to de 
Which type of acquiescence might be operat! 


CONTENT AN 


Agreeme = 

ID, has ih a pede also defines a factor 
Style ^c diga! no loading from other 
Negative t and, interestingly, has a high 
tent (arem ing from the Understanding con- 
Would Seena U Thus agreement acquiescence 
2" dimension s^ be associated negativ 
ility also de & intellectual curiosity. 
the wb ines its own factor (IV). Finally; 
Second-order scales appear to merge at the 
factor aD. level into one general content 
mensions 2 with the three interpersonal di- 
nderstan Puky high positive loadings, an 

icd B ing loading negatively. . : 

Support fo ie ae analysis provides 
Pendent 2: the concept of two relatively mde- 
acceptanc quiescence processes : agreement an 
the low eu his conclusion is consistent with 
tained — (11) that had been ob- 
actors, etween the corresponding primary 


ely with 
Desira- 


M, B 
asking of Content 


Tt i 
Ton: of some importanc 
Dohtent. Styles can mask the true € 
nt. The studies described in the previous 


Wo eae 
sections have already a 


e à p 1 
O DEOR of acquiescence. While no con- 
interpretation of this correction seems at 

d be pointed out 


a ple E 
ausible to us, it shoul 
le to make a post 


ss Bi if it were possib ) 
no a interpretation of this correction, 
es caes hypothesis yet published would 
Correc ed an investigator to worry about the 

tion in the first place. In a content area 


ess i 
S clear-cut than that of the semantic 4t- 
t concerned with 


ES an investigator not ©" Hits 
relati scence would accept his content cor- 
i. at face value—y aly be tre- 
ie d misled. For example, virtually all 
both ructors of adjective © not 
Ps ui to worry about acquiescence Temm 
Iont ey have even been advised not to worry 
Prisi it. Consequently, one finds such sur- 
ing results as conten ng 
Mesumably different traits o 
S .86, and .81; these were the correlations 
oo among measures of anxiety, depres- 
ion, and hostility 20 & study with the Multiple 
Affect. Adjective Checklist (Lubin & Zucker- 


t correlations amo 
n the order of 
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TABLE 8 
ROTATED SECOND-ORDER FACTOR MATRIX 
Second-order factors 
First-order factor M um 
I | IL | Ill IV 
I Agreement 
acquiescence 24 —26 76 —05 
II Acceptance 
acquiescence 90 12 —10 14 
III Desirability 46 -V 09 —92 
IV Adjective 
acquiescence 84 13 27 06 
V Exhibition —05 77 -0 —19 
VI Play 29 69 15 00 
VII Succorance 04 68 —09 36 
vill Understanding 10 —34 —80 —15 
loadings are in 


Note.—Decimals are omitted. Salient 


boldface. 


man, 1967). As we attempted to detail before, 
we can see no legitimate reason to dignify 
“content” traits with names implying inde- 
pendent constructs when the trait correlations 


are as high as this, and particularly when an 
acquiescence hypot vould lead one lo ex- 


hesis w 
pect distortion in a direction of high positive 
correlations. 

The Morf and Jackson (in press) study de- 
scribed in the previous section allowed for the 


identification of content, despite the operation 
clear-cut results like this 


of response styles; 
have only rarely been published (e.g Messick, 
1962). But what if content and style had been 
perfectly confounded, making it impossible to 
decide whether an observed factor was that of 
t or style? And for that matter, what if 
d style were 50 confounded that the 
content factors so highly correlated 
that they merge other? Clearly, 
n conceive where response 
ke the identification of content diffi- 
cult because of the greater preponderance of 
. tic variance. For example, in a study by 
(1967), the emergence of 
tor was found to be 
th the speed of presen- 
personality items. When items were 
resented under relatively speeded conditions, 
the largest factor obtained indicated an almost 
complete separation of true- and false-keyed 
hile under relatively unspeeded con- 


scales, V £ 
ditions, the acquiescence factor was only the 


LUL P. 


Sixth larg 
Similarly, 


well-determined in the less speeded condition, 


Jackson 


attributable to response style. One 
dominantly associated with the familiar ten- 
dency to endorse desira 
i econd factor was associated 
to endorse many or few 
ble in the present context 


MMPIand California Psychological Inventory 


all keyed true, and negative 
MMPI and 


content-saturated 
is consistent with the 

and Jackson (1967), 
Which indicates that a trading relation exists 
between acquiescence to Personality items and 
Content saturation, When content saturation 


; and vice versa, 
One conclusion relevant to scale construction 


This finding 
evidence from Trott 


final study serves to illustrate the po- 
tentially devastating effect that acquiescence 
may have in personality assessment by focus- 
ing directly upon an issue which is often of 
great interest: validity. Can acquiescence mask 
the existence of validity for personality items 
or scales? Bentler and Marshall’ asked each 
member of 4 Pair of friends to describe their 
own personality and to describe their friend’s 
personality under one of two conditions. In 
the single-adjective condition, subjects de- 


* Bentler, P. M., and Marshall, M. Effect of presen- 
tation format on item validity. Unpublished manu- 
script, University of California, Los Angeles, 1969. 
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“ing 
scribed themselves or their friend by be 
“true” or “false” to 300 adjectives ae 
to assess the three semantic differential ve n 
sions as well as other personality prim de- 
the polar-opposite condition, qe oosing 
Scribed themselves or their friend by p pair 
one of two members of a cales ee lis 
of adjectives. The polar-opposite adjec single" 
was identical, item by item, with m was 
adjective list, save that a polar Moins had 
added to each adjective. Thus the su d Sé y 
to choose one member of two ur than 
worded, polar-opposite adjectives, rath o was 
choose whether or not an adjectiv 
descriptive. sa 

Tiubisets of 100 pairs of subjects pu 
in the Study, 100 pairs in each sno item- 
Then it was possible to calculate 3 Jtem 
validity coefficients in each sandon i 
validity was represented by the peer 
between self and friend description, ca id indi- 
across subjects, A high correlation wou erso” 
cate, for example, that whenever & ra e 
described himself "active," his friend es a- 
to describe him the same way. A zero - was 
tion would indicate that, for this item, en 
not possible to predict how a friend MET. 
describe a person from knowing the De ient 
self-description. The 300 validity ane he 
for each of the two conditions fepe 
basic data of interest. Is there greater va up 
on the average, using one presentation fc 
or the other? ; :ouslYs 

From the discussion presented pees 
it is obvious that the single-adjective for oth 
lends itself readily to the operation of d 
agreement and acceptance power 
these types of acquiescence detract lidity 
validity, as could be expected, the vali 
coefficients would be lower than they poe 
otherwise be without the operation of bo e 
cence. The polar-opposite format, in cont! nce 
essentially eliminates the effect of ithe 
acquiescence since the subject must Ww 
one positively phrased adjective from a Pate 
This format, however, may not eliminate se 
effect of agreement acquiescence, if we nre 
sider a tendency to agree with the first ae 0 
tive as agreement acquiescence. Thus, the A 
conditions differ in that the single-adjecuV" 
format has Operative two acquiescence pres 
Cesses, Conse 


ated 


lity» 


the polar format only one, 


— —— M "S 
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TABLE 9 


FREQUENCIE 
EQUENCIES OF ITEM. VALIDITIES 
with Two FORMATS 


peu 
Validity GEETE Numbers of coefficients 
Polar opposite Single adjective 
72-84 | 
3 1 0 
60-72 1 | 2 
:48-.60 A 0 
36-48 22 4 
Eon 76 19 
B 122 90 
Beon an 64 137 
E o0) 10 48 
Quent], 
Jcntly, we would expect the validities to be 


Inh 
Br» the polar format. The frequency 
Speaks eg of validities presented in Table 9 
the à itself, Validities are higher by far 
fee aig format than the single- 
lm yet Eliminating acceptance ac- 
ive pow, e has markedly improved the predic- 

n er of the adjectives. 
because we reported this study P 
Could w 2 know that a content protagonist 
em iro; claim that the increased validities 
e Boe: the more clear-cut presentation of 
Tie in the polar opposite condition : 
«vung that “disagreeable” is the opposite 
agreeable” makes it clearer to the subject 
Ies “agreeable” means., It can be 
Sible f that this increase in clarity 15 respon- 
or the higher validities. Indeed, since 
opin ourselves deeply concerned with de- 
Ne A assessment strategies which capture 
orse "red of personality, we too would en- 
"'Onteni > technique from the point of view of 
alone. We thus agree that content 


clari 

i E 7 H . 23939 

inden contributed to the higher validities— 

iq ccd, we can envision studies aimed at 
tude of content 


ident: 
‘dling the relative magni : 
fop , A cquiescence effects in this situation. But 
ist for present we must ask, Would a pongi 
Ut t the content position alone have e 
Would « study in the first place? And what 
© th a content protagonist predict to happen 
4 € interrelations among the various single 
c Le tes when the total number of adjectives 
mod is taken into account, as in Bentler's 
C 9) study? If some content protagonists 
ould be persuaded to investigate effects such 


aq 
qu 


articularly 


of 
Just 
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as the ones described herein and to become 
interested in the many and varied influences 
which affect the identifiability of personality 
assessment content, we would be very pleased. 
After all, one of the reasons response styles 


are of concern is to provide a foundation for 


effectively tracking down the elusive “con tent." 
t seems to us that 


At the moment, however, i 
a viewpoint that concerns itself with both 
content and response-style variance is more 
effective in generating research on basic pro- 
cesses involved in responses to structured 
personality inventories than is one exclusively 
oriented toward identifying content. In fact, 
it would appear that a content viewpoint will 


even have to struggle to succeed in explaining 


the data we have reviewed here. 
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University of California, Berkeley 

cquiescence offered by Bentler, Jackson, 
entler et al. that their first 
scence (true-responding), is a relatively minor de- 
he MMPI, it is argued that their 
tative tendency to ascribe 
f the direction of item keying), is not likely 
ding the nature of responses to structured 
adduced by Bentler et al. as support for 
s unresponsive to the evidential require- 
ations for clarifying or responding to the 


The two-dimensional interpretation of a 
and Messick is evaluated. Although agreeing with B 
dimension, agreement acquie: 

terminer of responses on inventories such as t 
second dimension, acceptance acquiescence (the pu 
characteristics to oneself, regardless 0! 
to be of appreciable import for understan 
personality inventories. The relationships 
their view are evaluated, alternatively, 4 
ments of their position. Some recommend: 
possible role of acquiescence are offered. 
Meehl (1945) entitled “The Dynamics of 
‘Structured’ Personality Tests” has the field of 


by Bentler, 
t viewed the content of 


The; 
i , 
mmediately preceding paper 


eter 


SERS 
interest? and Messick (1971) seeks to renew r 
M acquiescence as à possibly important personality assessmen ‘ 
ctured per- inventory items in the naive, patent terms 
l. attribute to the “content” 


min: 
Hminant of responses to stru mde 
entler et al. 


as Bentler et al. do, this 


na s 
"rica" 1 Ventories. As reference to Psycho- E 
scence stracts will attest, interest 1n acqui- position. To contrast, al do, 
e has dropped away precipitously since unfairly defined “pure content position with 
y (1965) and their all-encompassing “content-style position” 
is to construct a straw issue; certainly, it dis- 


Boir 
en 

tiques advanced by Rorer 

of the issues that 


Myg 
ysel 
offered (Block, 1965). However, the newly c traw 1 
ent conjectures and evidence advanced by courages serious discussion 
s Mler et al, deserve consideration and re- may be involved. — 
A far better distinction between content and 
Messick (1962) 


ations and also 


at the present 
ntific concerns, 
ctively on 


style was made by Jackson and 
and is cited by Bentler et al. also. The reader is 
urged to note that where Bentler et al. construe 
content in face valid terms, Jackson and Mes- 


sick earlier permitted item content to be 
direct or indirect," so 


1 POnse 

because Because of space limit 

time nus d personal priorities 

ls hall h addressed to other scie 
sq, omment only briefly and sele 


vie 
ews and analyses of Bentler et al. 
j Bentler et al. ex- 


we T Es 1 
blicii n their introduction; : e m 
Y acknowledge the inadequacy of earlier obvious or subtle . - - i 
= nce employed long as response consistency was achieved. In 
to the conceptualization of 


r regard, then, 
ACy has been previously note by diverse “content” and of "style," 8 years have brought 
ag tigators; its P knowledgment by the pro- change but not progress, 1t would appear. — 
the ms of the earlier view closes it chapter n 3. The new proposal of Bentler et al., that it 

"story of personality assessment. Amajor may be useful to distinguish two types of 
c ce (agreement acquiescence and ac- 


On, : 
< Sequence of this acknowledgment 15 that the acquiescen ; : 
ort this faulty ceptance acquiescence), lacks the attractive 

f the earlier, admittedly inadequate 


Nee : Ri 
b isse E of acquiesce mp 
qu ackson and Messick (1958). This inade- 
i 


tudie 
es earlier seeming to SUPP 7 ceptanc 
4: "eptualization of acquiescence are now, DY simplicity of th n 
conceptualization of ncquiescence solely in 
ing. It was only after 


f true-respond 
dies (€-8+ Lichtenstein & Bryan, 


& Goldberg, 1965a, 1965b) 
that true-responding 


i : 
ae of obscure import. . E 
Dog In defining the “content” and “style terms © 
tions, T believe Bentler et al. state the issues reversal stu 
ic article by 1965; Rorer 


end s 
entiously. Not since the class! 5- det 
'p demonstrate clearly 
Teparation of thi eT LEE a 
: his manuscript Was facilitated Dy = 3 “ Š 
: ice Research Grant (agreement acquiescence) must be a relatively 
iner of responses on inventories 


minor determ 
ack Block, such as the MMPI [Bentler etal, p. 1910) 
nce acquiescence was invoked as a 


hit = 
Mg ed States Public Health Serv 
6080-01. 
bu for reprints should be sent to J 
poDartment of Psychology, University of California, 

j that accepta 


erk 
tkeley, California, 94720. 
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ve of an acquiescence inter- 
ctured personality inven- 


try for what Bentler et al. 

reversals because, for 
many inventory items, it is Psychologically 
haive to posit a polar opposite. For example, 
what is the polar opposite of the item, “I feel 
lonely much of the time”? As another illustra- 
tion, what is the polar Opposite of the person- 
ality trait of “dominance”? Is it "submissive- 

s itarianism”? 4 
ls" avoids the 
Possibly are not 
as psychological re- 
ater reyer- 
y negation reversals, 


» 4$ an explanation of an individual’s 
response to inventory items, W hereupon 
Bentler et al. have conjectured, subsequent to 
these findings, that in 


ondents still 


attending * meaning of the State- 


ments affirmed or rejected, 
tis now Proposed th, 
entially respond to 
negations in an inve 
à differentia] 


absence of 
because of 


direction of item 
keying, If the number of neg; 


ations is zero (or 
even), the respondent high 
acquiescence “true” to that item; 


nds to Say “false” 
answer of “false”? ig 
conjunction with th 


to that item 
a second nega. 
€ first negation, 


ecause the 
tion that, in 


* Another reconceptualization of 
terms of variance attributable to the 
Portions of an inventory scale. 4 


this time ( - Psychology and methodo 
response Styles. 


logical Association, June 1965), 
-perhaps becau: 


acquiescence, 
true and the false 


logy of 
at Western pa. 


les, independently constructed as polar 
Opposites of the origina] F Scale, should have correlated 
highly with each other if polar Opposites are easily and 
obviously achievable; instead, the correlation was only 


aracter- 
us " chara 
indicates a positive affirmation of the 


accept- 
An n ac 
Istic involved. The individual low o 


sitely- 
ance acquiescence tends to respond ier 
In both instances, the respondent a prs u 
to be an algebraic purist, BIRD negatives 
rigor with respect to the number 2 ing of the 
present while disregarding the TO he 
root verb or root adjective of the sta 
is endorsing or rejecting. ing to Bentler 
Thus, for example, according tive trait, 
et al. a respondent high on the pnta expect? 
acceptance acquiescence, would be t “I am 
to say “true” to. both the poder am 
usually happy,” and the Staleman both the 
usually sad.” He would say “false” to usually 
statement “It is not the case that I St ue case 
appy,” and the Statement “It is no 
that T am usually sad.” 
Although 
MMPI item. 
confounded 
acceptance 


the 
M 
& content interpretation dd be 


f 
in implications of SA 
With certain implication 


B jit 

ər imp =, 

€ by Bentler et al., beet r 
r position would contra Woul f 
| " 


deep sleeper; and so on? espo” 
that human beings rents | 
to Personally relevant inventory siae 3 
the automatic manner required by the * isst! 
ance acquiescence interpretation. bur be # 
is, or can be, an empirical one, It cope item? 
simple matter to construct quartets oF with 
Tmat illustrated wm 
Nappy” ang Bad” Chen 
aS, Suitably intermixed; ole 
to a suitable subject seen 
Particular response patt 


ce con” 
required by the acceptance acquiescen fter 
jecture is, in f; 


ict, to be seen reasonably direc 
patterns of subjects. M om ace 
aluating the potency 2 ms f? 
acquiescence interpretation ps the 
me to be far more Compelling in regard i 
Sind of evidence i 


in- 
* ne 

t can provide than He ali 
direct and confounded approach Bentler €t“ 


4p view 
ave taken to adduce Support for their 
(cf. below), 


4. Th 
MMPI 
Points 


thë 

e criticism by Bentler et al. ee 

i unproductive. Well-estab!is Ta 
are scored again: certain MMPI sc 


me 
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have 
MMPI cn psychometric properties, certain 
and Ce ree not manifest the convergent 
the E oF f validity their labels require, 
Usually em = factor analysis applied to the 
Actors "hes a ed MMPI scales indicates two 
Variati xplain” much of the pattern of co- 
ut on of these scales. 
tive in conre, NONE of these remarks is posi- 
To tisie v for the influence of acquiescence. 
is educine « the MMPI is easy; more difficult 
Oca] infe evidence for the direct and unequiv- 
in the MM of acquiescence, of either type, 
One, Todes I. This Bentler et al. have not 
9n the ote careful reading of their section 
Where is it iquitous MMPI,” reveals that no- 
i asserted that acquiescence, of either 


ind. ; 
is n : 
» ls one of the two main factors underlying 


MM 
PI scale intercorrelations. Although “de- 
be one of the 


Sir] E i 
sp is explicitly said to [ 
Claimed rene the other main factor 1s not 
Cult to or acquiescence. It is therefore diffi- 
apja estana the intentions or the ac- 
eta], ment of this section of the Bentler 
: argument. 
eon three lines of 
Modine. al. have adumbrated, 
“scence rong conceptualizatio 
find disappointing. 


recent evidence that 
ated for their new 
n of acqui- 


Th 
€ Bentler (1969) Study 


“ee study employs the semantic dimena 
ain dure, thus shifting from the inventory lo- 
rst Wherein the issue of acquiescence, x n 
Been aised and to which it 15 spec E 
Ea ee interpretation would apP — 
i Vol ing to the semantic differential pem 
that es psychological demands on the 5 A A 
inve She very different from those ee "i 
bip tory like the MMPI. Thus, pec oe 
diff y of any result achieved with the sema obe 
erential cannot be generalized to the inven 
ity medium without further research. Indeed, 
9ne's intent is to apply an acquiescence inter- 
Tetation to the inventory domain, it is an 
‘Wise detour to introduce 2 host of unknown 
Variables plus à significant generalization 
Problem, by employing the semantic differen- 
gonane i 
S Rentlet ly; 
"we s as well. It recommits the 
old error of presumns psychologically polar 
Opposites are easily attainable. Further, 


however, suffers from 


p 


Bentler employs as an index of acquiescence à 
re that Bentler et al. admit confounds “in 


sco 
he two types of acqui- 


unknown combination” t 
escence they propose as important to distin- 
But this score, the total number of 
ed, does not confound only 
ted kinds of acquiescence; in 
addition, this index is confounded as well with 
many other proclivities of the subject, for 
instance, the response set of extremeness, other 
response sets, and a host of personality vari- 
ables, all operating in unknown but reliable 
combination. 

Personality assessors over the years have 
come to recognize that scores such as the 
number of adjectives checked, the number of 
Rorschach responses emitted, the lengli of 
TAT protocols, and so on, although reliable, 
are not thereby unidimensional and unequiv- 
ocal in meaning. Differences in weight are 
reliable also. But differences in weight may 
be due to differences in age, in sex, in genetics, 
in the number of amputations experienced, in 
nutrition, in time of day, in body fluid loss, in 
wearing of clothes, and so on. To view 
an index of limb loss (after all, 
average weigh less than non- 
bination” with 


guish. 
adjectives check 
those two postula. 


the 
weight as 
amputees on the 
amputees) "in unknown com 
clothes-wearing tendencies (after all, people 
who wear clothes, when weighed, weigh more, 
on the average, than people not wearing 
clothes) is to ignore the primary sources of 
variance underlying the weight index in its 
usual context of measurement. If large sources 
of the variance are unrecognized, or the size of 
the contribution of other variance sources can- 
not be assessed, the interpretation of an index 
in terms of small or unspecifiable variance 
sources severely misrepresents its meaning. 
Positive evidence for interpretation in terms 
of acquiescence—of either or any kind—re- 
use of direct and unconfounded 
not simply a demonstration of the 
“existence of noncontent responding” using an 
index whose variance may be accounted for by 
myriad sources of large and unknown value. 
Phrased more vividly, a proof that someone 
is not in Berkeley is weak evidence for the 
assertion that he is at the North Pole. 
Incidentally, the discovery by Bentler that 
f the influence of individual dif- 
h respect to the total number of 


quires the 
measures, 


removal o 
ferences wit 
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Dt errors of rating or of 
Checking, Although Guilford’s discussion jg 
phrased in ter i 


ms of rat 


pplies directly. 
atively explains the Phenome- 
non construed by 


The M, orf and J, ackson Study 


Many aspects of the Mo 


Press) Study Warrant comm 


Concerns are noted here, 
First, Morf and Jackson 
their Study sought (o exclud 

trary scales z 
tory items 


rf and Jackson (in 
ent, but only three 


at the outset of 
e from the arbi- 
ituting all inven- 
y known to have high 
i Subsequently, acquies- 
cence of both kinds was found. 


is design decision astonishes me for it 
ut in order to “fing? acquiescence, 


tificially con- 


under 4 


Convenient street light rather than in the place 


Where the wallet should be found. 


Second, the inventory scales defined by Morf 
and Jackson, especially the facet-representing 
as noncoherent and as arbi- 

S only one of Many possible 
er the items “T am not likely 


5 Jackson ang Lay (1968) formed inventory scales 
using the same item Pool employed by Morf and 
Jackson, but did not exclude items having high content 
Saturations. They fi 


und almost no acquiescence, of 
either kind, 


ita 
: " rhowlt | 
to take a watch apart just to discove 


ues- 
works” and “I never seem to run out M 0 
tions on topics that interest me. NT scale 
these items were assigned to the va toward 
Supposedly to reflect an jy met ex- 
understanding many areas of know negative 
pressed via self-description, using viso. Be 
wording, and keyed for a (rue respo eflect, Ï 
first of these items seems to me to "ind ex- 
anything, a tendency opposite to „uct vali- 
pressed by the second, Since no consi is aske 
dation of these scales exists, the ced Jackson 
to accept on faith the labels Morf an re most 
have provided for these scales. I Zu ge fication 
Psychologists wil] require greater jus 
of these scale names, +h Morf and 
Third, the factor analysis on which psc o 
Jackson rely for the claim that two war in 2 
acquiescence exist is technically e the 
fundamental way. It will be recalled, tha 
equations of fundamental factor € 
the reliability of 4 measure sets an poen 
to the communality or Proportion of cot given 
factor variance of that measure. For $ xcee 
measure, communality cannot logically € 
reliability. 
Reference to T. 


article reveals th 
32 fa 


al. 
er ct & 
able 6 of the Bentler f the 


M 
nce of these crucially ees 
reliable). The reliabi : 
to .51. Further rere 
that the mean cnnan he 
Scales was .39 (i.e, on 


tant scales was 
ranged from —.05 
their Table 7 shows 
of these same 32 


as 
z ale w 
average, 39% of the variance of Ee * the 
“explained? by the factor analysis). 39) 
communal vari 


ance, then, only 72% (er 
de reliable, Especially embar 
i discrepancies abound (e.g., a commit y 
of .45 when the reliability of the scale is € 
-10!), 
I do not know wh 
ance, ther 


act 
at it means to pu 
n “interpreting” the not 
Bentler et al. do Po 
at they see between e. 
ation of chancy Variables and the 


B M- 
Ypothesizeq target matrix can well be 


M" G 
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M 
Dus Cis the light of the investigations by 
and M 2d and Humphreys, Ilgen, McGrath, 
Montanelli (1969). These investigators 


actor a si 
analyzed random variables and have 


Show 

D dismati s 

dramatically how complaisant factor 
hypothesized 


ünalvsi 
EIS Minds to be in fitting hesi 
9f Horn pus The findings and cautionings 
and have di umphreys et al. are compelling 
analysis; th ISINAYINE implications for factor 
€ attrac ey must be heeded by all who may 
o ected to the method. 
rn's summary is worth citing here: 


Seve 
Benerated Ey random variables for an AN of 300 were 
9t Variable. Morf and Jackson used an equivalent ratio 
arbitrarily and subjects, 51 variables and 195 subjects], 
into à ed named, intercorrelated, factored, and rotated 
esis facto, (least-squares) approximation to an hypoth- 
to be qui matrix. The resulting factors Were found 
Counts i interpretable and to have high hyperplane 
Were be though communalities and factor loadings 
Bator wer T here was thus indication that if an investi- 
e re willing to interpret relatively low loadings 
Bather ‘Seemed to “make sense,” he needn't bother to 
arbitrari il data: random variables may be labeled 
“good ly and pushed into solutions that make quite 
Sense" [Horn, 1967, pp- 819-820]. 


Th 
^e Bentler and Marshall Study” 


As Study demonstrates that a single-adjec- 
Olar- rmat for description 3S less valid than a 
e opposite dimensional format that anchors 
hun E of both ends of the dimension. 
Sola v et al. “know that à content protagonist 
Stem well claim that the increased validities 
from the more clear-cut presentation of 
ANE in the polar-opposite condition Ts 
he J agree that content clarity contributed to 
is Ted validity." They ask, however, 
alo > d a protagonist for the content position 
ne have carried out this study in the first 

Place?” » 
bee protagonist would h 
s question, but for a reas 


in his original analyses 
nse sets: True 


ave to say “No” 
on different from 


5 
Inge Run. pp. 73-74), 
respon, urs E. An analysis of two respo ed 
ora] ane and item endorsement. Unpublis hed doc- 
1969) dissertation, University of W estern Ontario, 
Cquie could not find a factor representing acceptance 
male iescence when he partitioned his subject sample into 
fie and female subsamples, despite efforts to maxi- 
whol fit to this hypothesis and despite the strong part- 
mur relationships between his subsamples and his 
e sample. 
_ 7 Bentler, P. M. and Marshall, M. Effect of presenta- 
tem validity. Unpublished manuscript, 


tion format on item Ve 
niversity of California, Los Angeles, 1969. 
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the one Bentler et al. would prefer. He would 
have again remembered, for example, Guil- 
ford's chapter on rating scales (Guilford, 1954, 
Chapter 10), and known that greater di A 
nation and lesser equivocality result from 
anchoring both ends of a response continuum. 
Accordingly, greater validity will be found for 
a dimensional format as compared to a single- 
adjective format. Further demonstrational 
studies of the advantages of score unambiguity 
and of increased discrimination are matters 
of low priority in the field of personality 


assessment. 

6. My remark: 
critically, on the Bent 
tion and evidence. Some 


are in order as well. 
Direct, unconfounded, construct-valid mea- 


sures of each of the newly proposed aspects of 
acquiescence are required. This is a long and 
difficult responsibility, but it must be assumed 
if the two-dimensional interpretation of acqui- 
escence is to become more than a post hoc con- 
jecture. Further research employing admittedly 
confounded indexes will not be helpful. 

It will not be fruitful to study acquiescence 
until the nature of the interdependency claimed 
by Bentler et al. to exist between the two kinds 
of acquiescence, and between content and each 
of the two kinds of acquiescence, is specified 
or shown to have lawful parameters.Presently, 
no conceivable result can embarrass this all- 
embracing interpretation because the princi- 
ples of interdependency have not been stated 
prior to and independent of the relationships to 


be evaluated. 
When direct and unconfounded measures of 


have been developed from differ- 
these measures should be corre- 
hat, indeed, acquiescence is 
disposition. This has only 
he past, and, when tried, 


s thus far have been focused, 
ler et al. conceptualiza- 
positive suggestions 


acquiescence I 
ent domains, 
lated to establish t 
a broad response 
rarely been tried in t 
has offered only disappointing results. 

In the effort to develop compelling evidence 
for acquiescence, further usage of the method 
of factor analysis as it has been employed in 
the past would appear to be unprofitable. 

Until positives systematic evidence, based 
construct-valid indexes, is developed 
to the importance of agreement acqui- 
] acceptance acquiescence, investi- 
a choice of three alternatives. They 


upon 
attesting 
escence anc 
gators have 
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can ignore the Possible influence of acquies- 
cence, usually with no particular danger. Or, 
and better, they can adjust their indexes yi, 
techniques such as the Guilford proc c 
partial correlation to eliminate the relation- 
ship-obscuring effects of the specific variance 
introduced by diverse subjects. Or, 
they can prevent at the outset thi i 


edure or 


be uninteresting or 
i her, it may be 
or to prevent, 
9r procedura] 
Such response 
Occupied with 


trivial or idio 5 
quite sufficient sin j 
by Suitable 


Means, the obscuring effects of 


Styles rather than become pre 
them, 
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The tendency for two variables 
sampling occasion to the next is 
directional correlation (d coefficient. 
association, especially rank correlatio! 
directional correlation may diverge £ 
correlational statistics will suffice in 
tional correlation present, they wil 


modifications involve consi 
recommended when directional correl 


s, aon between two rcm 
Cattell’s (1952) t — dre op 
oiber statisti x n le a ox ‘another corre- 
pe on coef cie p ki oe association may 
Pe eng icient, One kind of assoclauio® 
telice Id however, that is not a 
erm " by usual correlational techniques. E 
enden inectianal correlation is given L. the 
the sa cy for two variables to change values in 
[n een direction from one sampling occasion 
üble en 1f, for example, increases n m 
incre; are accompanied for the most part by 
ases in variable Y, and decreases in 2 
eei ga in Y, then one can spe of € 
We of positive directional correlation 
en variables X and Y 
es Es possibly m! 
c osel. : irectional correlat 
Produ. other kinds of as vp 
Bib Ge ch and, particular Y ran 
d irect When this is not the case, & m 
a mez ional correlation may be of valu 
asure is easily derived. 


time-series situa- 
ill parallel rather 
such as 
k corre- 
asure of 
e. Such 


ost, 
jon W 
association, 
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on : 
n successive occas! 


Ip: 
lite his work was initiated i 
ratory of D. T. Lykken, 


t to Robert 
of 


to change values in the same 
termed directiona 
) will often parallel oth 
n. In time-series (P-type. 
greatly. Although modification 
some of these situations to re 
] not be generally adequa! 
derable effort. The e: 
ation is a phenomer 


direction from one 
1 correlation. 'The coefficient of 
er measures of bivariate 
) situations, however, 
s of traditional 
flect the direc- 
te; moreover, the 


asily computed d coefficient is 
non of interest. 


made of the (GY) values of each sampling 
occasion after the first with the (X, Y) values of 
the immediately preceding occasion. A plus 
score is given for each instance in which the 
direction of X is the same as that of Y (both 
variables increase in value or both decrease), a 
minus score for each instance in which the 
direction of X is opposite that of Y (one vari- 
able increases, the other decreases), and zero 
or no score for each instance of tied data 
(either or both variables remain unchanged). 
The coefficient of directional correlation (direc- 
tional or d coefficient) is then defined as 


= p= i 
d pm F Eü 


mber of pluses, and 7 the 
If there are no tied data, 
d can equivalently be 


where p is the nu 
number of minuses. 
ptm=n— 1, and 


written as 
—m 
dci [2] 


n—1 


nt that Equation 1 in effect omits 
atistical consideration. In 
h omission is not desired, 


be used. 

As do most other correlational statistics, d 
1 through 0 to +1, these values 
g, in turn, perfect negative directional 
correlation, he absence of directional corre- 
lation, and perfect positive directional cor- 


relation. r . 
The following 1S an illustration of the compu- 
tation of d. The six bivariate observations are 


listed from top to bottom in order of sampling. 


It is evide 
data from st 
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Y 
6 4 
8 5 F 
2 3 s 
7 1 = 
7 2 0 
9 5 3h 


nuses, 
and zeroes for each variable Separately, as that 
Variable Increases. decreases, and remains 
Constant in value, then calcul 


Some STATISTICAL RELATIONS 


er Statistics, 
Mas’ (1946) 
> 7na, though the 
independently 

related to 7 
ank correlation, 
at, in the case 


immediately; 
_ D — 
ep PCR 
= n(n — 1) 
2 


Where band P 
and 3f 


for the 
OWwever, in- 
all possible pairings of the 
ple (nfm — 1]/2in 
S scoring for d involves 
ordered Subset (y — 1 in number), 
n the general case of tied data, d is directly 
analogous to Y, Goodman and Kruskal’s (1954; 
see also H; 


ays, 1963) variant of T, rather than 
to Kendalp’s tied-data expression: 


P—M 
Pao YA 


d —P-—m 


[4] 
= — P M 


E= re 


aje] 


izes 
~ r ecogni^ 
(It should be"noted that ppt BOR im 
tied-data situations in which ei 4, is the 
Equation 3, rather than Equati 
appropriate Statistic.) tween direc- 
A more fundamental relation be bé stated. 
tional and rank correlation pee ei 
When sampling randomly from Sik of tht 
Population, 7 is an unbiased es efficient, f^ 
Population rank.correlation co Iso is an UP 
Under these same conditions, d a more infor 
biased estimator of TAT magne 
mation from the sample and peri e hand, 
the more efficient statistic. d, on th increasing 
is considerably easier to calculate, 
SO as 2 becomes larger.4 


d EIE SE 
TIME-SERIES CA orms) 


ious ft 

The likeness of d and + (in its A 
has been shown in the previous i arises, 7? 
difference giving point to this s aar] where 
noted earlier, in the time-series cone dom; an 
sampling is temporal rather than parr than 
observations generally dependent pe case § 
independent, 7 and 7 might in this ight als? 

€ comparable in value, but they m “obse! 
be quite divergent, It is possible to „riable’ 
strong direction 


: aen Vä 
al correlation between 


be 
haps or 
* That dis an unbiased estimator of 7’ it ova 
seen directly, A proof requires that the nd toT: 
all possible random samples of size n be eq 
can be shown that each of the 


NO eon 
7 GY) 


"RN n 1 E an in 
pairs in the population of size I is scored i 


2(1—1) (y —2) 


(N —n)! 
9f the 
N! 
(N —n)! 
Possible r. 


andom sam 


b 
hen 
ntl 
ples. The mean of d ca 
Expressed as 


n— 


M [26-nar-2i N! ] 
1 N-n 


(N—n)! 
Which reduces to 


iot 

' : etio; 

» e Pertinent reference, discovered after comp js 

(1945) manuscript, js wie Paper by Moore and tial? 

direction PP work deals in habe With what is essen 
Tectional Correlation, 


ee hl 
ka J EES eee 
SE ee -—u 
Ww» 


DIRECTIONAL CORRELATION FOR TIME SERIES 


puse rank correlation is negligible or even 
mes A SIBI and vice versa. The source of 
to ae resides in the fact that d tends 
late short-term (phasic) association to the 
ation e neglect of the long-term (tonic) associ- 
present, whereas the reverse holds for 7. 
us : of Figure 1 are extreme examples con- 
ares 2 (with no ties, as is true of subsequent 
Bes to illustrate this differential sensi- 
NU these two components of covariation. 
"mpling points for the two variables (X and 
) are represented by closed circles. C of Figure 
for example of d-r disparity caused pri- 
time 3 f by a nonmonotonic association with 
anc he One of the variables. D illustrates yet 
ther kind of time-series situation 1n which 


MAGNITUDE 


A B 


d-100 d= 400 


T- -50 
i= -69 
(= -87 
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directional and rank correlation differ mark- 
edly. 

The significance of the phasic-tonic distinc- 
tion is further illustrated by considering the 
effect on the examples given of decreasing 
sampling frequency by a factor of two or more. 
Most dramatic is the persisting reversal of d's 
sign in A and B. d is now reflecting (even more 
strongly than 7, as it happens) the strong tonic 
relations between variables. Insofar as the kind 
of association revealed depends on sampling fre- 
quency, that factor is clearly of concern to the 
investigator. (Also at issue in some situations 
is whether sampling is to be strictly temporal— 
for instance, every 10 seconds—or to bear some 
relation to whatever stimuli may be present.) 


E D 


d=100 d= 100 
FB F4 
r= 30 f= 52 


f= 52 


fey.t =1.00 y-t = -1.00 hy-t :.39 lyst = 51 
faxay 7100 xay = 100 Iaxay SH xay :.59 
observations. 


Fic. 1. Four hypothetical examples of bivariate time-series 
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hree common correlational statis- 
5 r, Spearman’s rank-correl 
and Kendall’s ; is refle 
onal correlation present, it 
that some modification of 
atistics, such as 


ation 
cting 
might 

these 
Partial correla- 


product-moment correlati 
In certain instances, b 
tional Correlation will be 
or Product-moment corr vith time as a 
Partialed-out third variable. The instances are 
those in which Associations between time and 
the basic variables are substantially linear (r) 
or monotonic (7). (There is no partial counter- 
Part of p.) A and B of Figure 1 are examples 
in which partial Correlation technique is quite 
effective (due to the Strong linear associations 
wit time); c are examples in which 
i (due to the absence of 
1c—associations), 
ach to approximation of 
ation involves change Scores, 
ables are AX; = Xin- x, and 
than Y; and F; 


ampling, ranges from 1 
sly, d isa 


on. 


Here the vari; 


d, but in C and D, 
Sruence—especially in D. The 

'viation is apparent: 
change values Consistently 


direction, the Magnitudes of 
versely related, 
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, by using the 
assured an adequate measure only ee Ts far 
d coefficient. Moreover, that coef statistic: 
easier to compute than an "alternativ hi coefli- 
(A d-like index can be calculated oF abe o 
cient fashion using the same n roper. Al 
plus and minus Scores—utilized by posu this 
though yielding generally — “ably more 
alternative statistic involves er appear to 
Computational effort and does ay 
have any compensating advantage. order. The 

Two concluding comments arein aal coefli- 
first is reiteration that the pene 
cient is Presented and discussed asa aplications 
Statistic. (On the inferential cane 0 
Stemming from the general deper instance 
time-series Observations see, on secon 
Cattell, 1963, and Holtzman, 1963). .s no brie 
is the caveat that this pud coefli- 
for the general utility of E some 
cient. Although the Statistic has al conte* 
application in the "hien ge be see” 
in which it was developed, it remains elation ! 
the extent to which directional Core pet wee? 
distinct from Other kinds of relation which 
time-series variables, and the degree to 


rS. 
m N investigato 
that distinction 15 Important to investig 
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Intercultural adjustment is des 
or culture shock. However, very 
approaches for explaining or un 
patterns of behavior and personality 
plaining intercultural adjustm 
situational factors have been 
proaches can be more meanin; 
degree that they facilitate or 
cultural communication appear 
understanding the adjustment o 


trait: 


Ds The purpose of this article is to discuss social 
oi chological factors related to the adjustment 
Oblems of the sojourner. Books and articles 
y Cleveland, Mangone, and Adams (1960), 
kroth (1968), Gardner (1962), Gullahorn and 
iullahorn (1963), Hall (1959), Hall and Whyte 
Mai ), Jacobson (1963), Kelman (1965), and 
soj gpr * have discussed various aspects of 
urning and adjustment. 
ies term sojourner refers to many types of 
Tn including students, trainees, technical 
Son dn s vi businessmen, military per- 
tofe ; Missionaries, foreign service officers, 
Dima nt and others. The sojourn research has 
has arily considered the foreign national who 
sojourned in the United States, although 
a appears to be an increasing interest in 
American who has traveled to other 


Ountries (e.g., Major, 1965, see Footnote 2; 
mith, Fawcett, Ezekial, & Roth, 1963). 
e dealt 


sive among those studies that hav 
dis the American sojourner, there has been a 
E onmia emphasis on the study of 
tö pus and educational advisors. According 
atterson (1969) and the United States 
Ureau of the Census (1969), pleasure and 
Personal reasons were indicated by approxi- 
"uel 78% of all United States passport 
Cipients in 1967 and 1968 as the two main 


D, Requests for reprints should be sent to Kenneth H. 
avid, P. O. Box 856, Peace Corps Training Center, 
Diversity of Hawaii, Hilo, Hawaii 96720. 
.. Major, R. T., Jr- A review of the research on inter- 
National exchange. Unpublished manuscript, Putney, 
ermont: The Experiment in International Living 
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cribed, in general, by means of a W-curve function 
little apparent integration exists among the various 
derstanding the sojourner's adjustment. Social 


s have not been very successful in ex- 


ent, while social interaction and background and 
of somewhat greater value. These divergent ap- 
fully understood in relation to adjustment to the 
impede intercultural communication. Thus, inter- 
s to be both an integrating and crucial factor for 


f the sojourner. 


purposes for their travel, Vet, these individuals 
have been relatively neglected by the sojourn 
research. Another emphasis of this kind of re- 
search has been on the study of the traveler 
during the period of his stay in the foreign or 
host culture. There have been relatively few 
studies of the adjustment of the returnee to 
his home country, the permanence of the effects 
of the sojourn, the effects of the sojourner on 
his hosts, and the effects of the returnee on his 


own country. 

Because of the increased opportunities for 
international travel, more and more persons 
from many countries are interacting with one 
another. With such increased frequencies of 
intercultural interaction, the sojourner is faced 
with a number and variety of adjustment diffi- 
culties which can be traced to both the obvious 
and more subtle differences between cultures. 
The major purpose of this article, therefore, 
is to discuss the social psychological aspects of 
the adjustment of the sojourner and to provide 
an integration of the research concerned with 
this topic. 

Adjustment problems can have serious con- 
the sojourner. Thomson and 


sequences for 
example, reported that 


English (1964), for l 
adjustment problems constituted over 60% 


of all prematurely returning Peace Corps 
Volunteers prior to 1963. The problems of 
adjustment are also seen to range widely in 
extent, focusing on the sojourner's relationship 
to himself, to others, to the environment, and 
to his activities within the host culture. 
Smith gi Rei discussed the 
adjustment difficulties that foreign students 
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Cace Corps Volunteers in relatio 
y | their physical well-being 


n to their jobs, 
locations, 


; and their in-country 


| A major problem in 
of the adjustment of th 


tionships, however, he development of 
understandi h 


ost and visitor, 


by the authors as being crucial 
ment of such understanding, 
integrating * seemingly 

Approaches to understanding the 
Process, therefore, is to interpret 


within the broad framework of intercultural 
communication as outlined above, 


Manner, a Variety of factors may be seen to 
influence the aq; 


CJustment process insofar as 
they affect the Process of information exchange 
9r communication 


rgent 
adjustment 
adjustment 
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iT FUNCTIONS 
U anv W Curves OR ADJUSTMENT FUNC 


at the ad- 
The sojourn literature indicates mon pre- 
justment of the traveler seems to E s seems 
dictable temporal pattern, That IS, "i adjust- 
to be a curvilinear relationship ipeum Lys- 
ment and time sequence of the i drin Ful- 
gaard (1955), in a study of beni 
bright Grantees Studying in the Unite ion an 
s curvilinear functi n. An 
as the U-curve phenomenon 
Sewell and Davidsen (1956) reported a aent for 
i academic personal adjustn Unite 
Scandinavian Students visiting em only ? 
States, Initially, researchers reporte rited to 
Curve, because the Studies were E hos 
the sojourner’s adjustment within Jopme? 
culture; however, a more recent ient. be : 
Suggests that a W-curve function Be ajust- 
more comprehensive description of paier j 
ment process, The curve 1s eS > to 9 
an extension of the single-U curve a W 
double-U curve (UU curve), that d been 
curve of adjustment na 49631 
and Gullahorn "ner! 
both the er re- 
adaptation to a foreign culture and n the 
adjustment to his home culture. Ras aD 
-curve function represents the adjustm a 
the sojourner along a temporal Houma t 
4 Very general Manner, the sojourner by after 
undergo a decline in adjustment cher thy by 
entering a foreign culture, which is ate i 
a recovery Stage with a resultant gis s0- 
adjustment; then, on returning home, tust 
journer undergoes another decrease in a ced 
ment followeq by a second stage of denn e- 
he degree and duration of the adjustmen end 
cline for an individual Sojourner would oe 
on a number of Variables, which are discu the 
i er sections of this article. Thus S in 
scriptive of the GHE a 
a larger number of sojourn“ . 
may 
among individ 
instances a 
fully recover f 


t encompasses 


al though there 
ability 


ip 
no appreciable decteest ed 
^ Some sojourners may $e 
ew adjustment difficulties upon a 
Ome, while others may undergo 
Te reentry crisis. 


lso 


—————— 
‘ a 
a 
a 


" 


During the initial phase of the W-curve phe- 
Nomenon, which may last for several months, 
Spending on the length of the sojourn, the 
Sojourner shows a relatively satisfactory adjust- 
Ment to the host country. The visitor is mainly 
à spectator, absorbing the sights, forming im- 
Préssions, and expressing enthusiasm. His 
interactions with the host nationals are quite 
limiteq and uninvolved. Tourists or short-term 
Sojourners are usually characterized as rarely 
eating from this superficial level of inter- 
‘on with host nationals, perhaps because 
be Purposes are related to recreation, re- 
any Sn and sightseeing, which may preclude 
Mer un involvements with the hosts. A 
ase in the sojourner’s adjustment, that is, 
hae slope in the W curve, begins after 
een initial spectator phase. The sojourner um 
nco In the host country for a period of time an 
adi unters gradually increasing demands to 
JUSt to a new and often strange environment. 
© sojourner’s previously limited involve- 
‘ents with the hosts no longer suffice to satisfy 
nce eds to cope with his intercultural experi- 
Of a He attempts to unravel the complexities 
‘ele © culture, and to accomplish this he de- 
bu the need to establish more enduring 
Br relationships with the hosts. He fige 
hoses that his elementary knowledge of the 
anguage is no longer adequate enough to 
€ with the increasing complexities of social 
nep nication. This is a time when the werd 
bons involvement. with the host culture. e- 
n Nes critical; he is at the point where he bi 
othe need of satisfactory relationships wit 
rs, yet, he lacks the social tools or means to 
"E these about. This is the low point of the 
a where the characteristic eyinpiome oF 
(1o Ure shock occur, as described by a = 
a 60) and others. After this, a gradual - 
Very in the sojourner's adjustment occurs (an 
Ward trend of the W curve); where he begins 
hos come aware of the more subtle of d 
t culture and begins to develop a fluency In 


i E language. He develops à strategy for work- 
ng through various intercultural difficulties, 
à Yen the sojourner returns to his home cul- 
ture he may undergo what has been termed a 
m a 
reentry crisis" which is analogous to the first 
drop in ‘the W ’ urve. His decline in adjustment, 
©Wever, seems to be somewhat less in intensity 
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than during his initial culture shock reaction 
to the host culture. 

The sojourn literature indicates that the W 
curve may show cultural variation. For ex- 
ample, Pool (1965) reported that among foreign 
students to the United States, those coming 
from countries that are relatively more similar 
to the United States, that is, other Western 
nations, differ in their adjustment patterns 
from those students from countries less similar 
to the United States, that is, Asian or non- 
Western nations. Sewell and Davidsen (1956) 
reported that language facility, social inter- 
action with Americans, previous contact with 
other cultures, and personality characteristics 
may influence the rate that a sojourner to the 
United States passes through the various 
phases of the W curve. 


CULTURE SHOCK 


Rather than describing adjustment strictly 
in terms of curve functions per se, other re- 
searchers have portrayed adjustment in verbal 
terms as a series of sequential stages. Although 
differing with respect to numbers and kinds of 
stages, culture shock, role shock, language 
shock, culture fatigue, and reentry crisis all 
bear in common a close resemblance to one or 
more aspects of the U and W phenomena. 

Hall (1959) defined culture shock as “a 
removal or distortion of many of the familiar 
cues one encounters at home and the substitu- 
tion for them of other cues which are strange 
Lp. 156].” Oberg (1960) is generally credited 
for having introduced the term in one of his 
earlier articles. He conceived of culture shock 
as an "occupational disease" of people who 
suddenly find themselves located in a culture 
very different from their own. Oberg stated 
that culture shock is precipitated by the anxiety 
that results from losing one's cues to social 
interaction. The cues include the many ways 
in which one orients himself to daily life: when 
to shake hands, when and how much to tip, 
when to accept invitations, when to take state- 
ments seriously, etc. The cues, which may be 
customs, gestures, facial expressions, or words, 
are primarily unconscious and are as much a 
part of one's culture as the language. 

Oberg described Several stages of culture 
shock: (a) Stage 1 is a period of incubation, 
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journer may feel highly 
elated. He may be living in a good hotel where 
Conditions are comfortable, the sights are 
intriguing, the hosts are courteous, and his 
expectations for the future are positive, During 
this period he may find à house, maids, schools 
for his children, etc, (b) Stage 2 is 4 period of 
Crisis resulting. from the genuine difficulties 
that the Sojourner may begin to Encounter in 4 
different culture. The activities of q 
that had been Previously taken 
become insurmountable problems, He may 


associate with other Sojourners and criticize the 
Ost country and its 


nationals ay be described in 


aily living 
for gran ted 


3 is a period of recovery in whi 
Degins to understand some 
host culture. He regaj umor, and 
© may even im authority on 


€ complete or 


"disease? model that Oberg used to describe 
culture Shock, H; 


basic ahorn 
(1963) equated culture shock 
issonance that results from the di 
between the Sojourner's e 


and his Observations of that 


tages of 
(a) A Stage of fascination in 


are various barriers Preventing 

€raction with host Nationals, (b) A 
llity which occurs as the 

he residence develops. There 

social inter, 

the sojoury 


perman- 


is more 
action within the host country, but 


her €mphasizes the superiority of his 
own culture, (;) 4 Stage of improved adjust- 
ment, ssion of humor and lessen- 
A stage of biculturalism, 
rner develops an under. 


with an expre 
ing of tensions. (d) 
in which the Sojou 
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- in ac- 
Standing of the host culture E e 
cordance with its norms. These ie ls 
be similar to those Proposed by m have 
The crises of Peace Corps kw m the 
been described by Pearson (1964). to explain 
first crisis the volunteer is ccm believes 
that he is “none of these things. o his hosts 
that he must explain his purpose i secon 
and dispel their misconceptions. anoni the 
crisis is one of acceptance of the situa oluntect 
romance and adventure are gone; the oie 
becomes bored, lacks intellectual m o 
and his work becomes tedious. He Jitude.” 
read more and discovers a “society of sif oy 
The third crisis is one of se hija cate ty 
frustrations, There is a feeling of iden 
an awareness of the “different life. first-time 
According to Foster (1965), im 
longer-term Sojourners recover 6 months, 
culture-shock €xperience in less than ra yeah 
and it is not uncommon for it to last fà recove! 
although a resilient sojourner may hat one 
within 3 months. Foster suggested t muni 
culture-shock experience will not im futur 
one against future shocks, although the 7€ 
experiences may be less severe, and 
adjustment Stages Somewhat shortened. pe the, 
Smalley (1963) Considered language to use 
Primary determin dium ° 
of the importance of language as a rad f 
communication. He added that the ok for 
anguage per se may result in an acute s ee wi 
Some people, Smalley appears to dien eel 
all over the relative importance of i . em 
types of communication, with Smal 7 Hall 
Phasizing verbal communication, and 
Stressing the nonverba] aspects, a tech- 
In an article about overseas prier yole 
nical assistants, Byrnes (1966) described eure 
shock” ag being Somewhat distinct from i was 
shock. Unlike culture shock, role RH en 
reported to be more Severe, to increase mid- 
time, to frequently reach its peak at the anc 
Doint of the expected length of the soie 
to rarely result in Complete TEGONELY: to 
Shock for technical assistants was believe na 
result from ambiguity in their professioni 


ir 
Toles, unsuccessfy] relationships with p 
St-country Counterparts, and problems Ww 
Communication and participation in the ho 
Country hierarchy 


an! 


a 


P UN 


O 
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b a (1966) suggested that the term 
IScription eU may be a more appropriate 
olunteer : the reactions of Peace Corps 
inum e hilippines. The hospitality 
| With the fairly of the Filipino, in conjunction 
s Wareness of T CORO use of English and the 

' the An certain Western values, may result 
Philip encan Sojourner believing that the 

| oy ime culture is very similar to that of his 
ities E ver, in spite of these apparent simi- 

| vell as the adoption of some Spanish 


Ameri 
rican c C 
Main an customs by the Filipinos, there 


Country many elements that are unique to that 


Bay reg 2 apparent cultural similarities 
fusion E t in the sojourner experiencing COn- 
long e annoyance, and when these frustra- 
| be tern Ure over a period of time they may 
0, in culture fatigue." Szanton (1966), 
Cop SCussed culture fatigue among Peace 
fidere olunteers in the Philippines. He con- 
that re culture fatigue to be the exhaustion 
Adjust Sults from the endless number of minute 
fore Ments required for long-term living In a 
| 80 culture, 
De s returning to his home country, the sojour- 
Sh med undergo what has been termed a re- 
| Con Crisis, The crisis of the returning Peace 
(19, PS Volunteer has been described by Pearson 
and Stolley (1965). The volunteer, 
"S lived for 2 years in a developing 
tY, may find difficult his readjustment to 


Clative affluence of the United States and 
alues espoused by 


ha 

Co 

t 

th 
the mo tingly superficial v e 
| “Ul tie ass media. Moreover, he may have di » 

Vol, 5 In adjusting to a change in roles. As à 

assumed many respon- 

le at home, may Im 
uation with 
nt feelings 


nd resulta 
lessness. The return- 


Olunte, 

“biti, ter he may have 
Ms 2 but, after a whi n 
iE ina relatively ambiguous Sit 


hi 
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of ya responsibilities a. 
in ISillusionment and aim 
has. Volunteer is confronted with many be- 
thie that prior to his departure trom, the 
No “hs States were primarily pee et 

Dur cd unquestioned, and taken od Ere : 
tha, ct returning home and becoming aware o 
| 4 “Se previously accepted pehaviors, these may 
* zn meaningful. The apparent 
he heavy traffic, ubiqui- 
N about, etc., may be 


vap P"Ser seem 
to ues as reflected in t 
* Ve S television, rushing n : 
SN ty difficult for the returnee to reconcile with 
| he acquired while in the host 


P. Values that I i nai 
Untry, for instance: values of social concern 
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which he acquired as a result of teaching, pre- 
venting illnesses, or increasing food production. 


SocraL PATTERNS, PERSONALITY, 
AND ADJUSTMENT 


Various investigators have attempted to 
construct sets of characteristics and distinct 
social patterns that could be used to distinguish 
between groups of sojourners. There may bea 
limited number of alternatives that a sojourner 
may choose from to cope with a foreign en- 
vironment, and each alternative would include 
a characteristic modus operandi. Once the dif- 
ferent social response patterns have been de- 
termined, a fairly accurate description of the 
social behavior of any sojourner, after he adopts 
one of the characteristic patterns, is possible. 
For example, Shelton (1964) observed a charac- 
teristic pattern of social interaction, which he 
termed the “Miss Ophelia Syndrome,” among 
a group of investigators conducting field re- 
search in Africa. The syndrome consisted of an 
avoidance of physical contact with the Africans 
and avoidance of accepting Kola (nuts), food, 
or wine from them. The usual reason, or 
rationalization as Shelton perceived it, was that 
the investigators were acting in a scientific 
manner of detachment which was appropriate 
for conducting their research. However, the 
Africans, as well as other host nationals, are 
likely to have interpreted the aloofness of the 
researchers in a manner that may have 
jeopardized the probability of the researchers’ 
success. The villagers may have believed, for 
example, that the “Miss Ophelias” considered 
themselves to be superior to the Africans. 

Sewell and Davidsen (1956) described four 
kinds of sojourners and the characteristic pat- 
terns of adjustment that they adopt: (a) 
Detached Observers, (b) Promoters, (c) Enthu- 
siastic Participants, and (d) Settlers. Sewell 
and Davidsen added that these patterns of 
adjustment are related to the sojourner’s 
perceptions of roles and their expectations with 
respect to returning home. 

The Detached Observers were described as 
having very little involvement or need for 
involvement with the host country, Their 
motives for the sojourn are usually technical 
and clearly defined, and they form friendships 
with fellow sojourners rather than with host 
nationals. They do not show severe adjustment 
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problems, either q 
turning home. 

The Promoters wW 
low to moderate 


uring the sojourn or on re- 


ing 


ost 


e countries. 
The Enthusiastic Participants were described 
as detaching themselyes from their home 
country and actively p in the host 
culture. The 


Xperiences, Their 
are not exclusively 
have 

the 
; their expectations of the 
not clearly 
crises, 

as showing little 
heir home country. They adapt 


» In time, show a genuine and 
thorough adaptati St country, 


In India 


echnical 


a high degree of self- 
esteem, although early i i 


they have as many problems 


Merican wives, Tf they are not 4 
their problems through : 
Sure on their 


Ne the host 
country, " m 
The Cautious are neither enthusiastic nor 
indifferent. 


They weigh altern 


atives carefully, 
work through th 


and Slowly eir Problems, Tf 
they are not able to cope with their Problems, 
they look for a More congenial environment, 

The Supporters are responsive to direction 
from others, They are supportive of Others, 
although they do not take active leadership, 
Tf placed in highly Structured situations, they 
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m | 
il they lea 0 
will have adjustment problems until they 


are in W 
the cultural patterns; if they emit dis- 
Structured situations, they will 
organized by their environment. "e. 

The Fumblers are noncoping, a elf direct® 
nonsupportive. They are aue ie c 
nor responsive to direction from d to the Jo 
are a burden to their husbands des iners, 00 
community, are chronic comp E they 47 
blame others for their problems. | roble 
ultimately unable to cope with ge S 
their husbands may be asked to res San 0 

Useem emphasized that these P fin bast 
behavior would not provide an Y overse?” 
for the selection of personnel d of à P 
assignment, Since, with the jexcepti a life PE 
Fumblers, for whom fumbling ae ka cons 
tern, these behavior patterns were yoman V gi 
tent over periods of time. An older 3 lifetime! 
had been a Cautious type during apre cha 
the United States, for instance, me could P^ 
into a Coper in India. Also a womar could b^ 
a Coper during one assignment, Lie assig” 
come a Cautious type during the a w n 
ment, and all could become Fumbler$ » 


eca 
placed under a great deal of stress. PP. vi 
Useem found little evidence that P5 e foun 
Social behavior were long enduring, behavior 
them to be of little value in predicting rs om 
Smith (1956) reported that Anja at" 
Similar Cultures tend to develop comm e oU 
terns of beh er? 
Served that 


and 


m$, 


"ODing countries when pe n "n 
to the Uniteq States, because they 0 jns E vi 
i Status during their wem n d 
example, foreign Scholars who per P 


sence 9 
general Superiority of American scienc p 
technology MAY’ feel a loss: of status ig s 
Studying in the United States (Be! 9803 
Humphrey, 1957; Lambert & Bressler, 5) 
Scott, 1956) Lambert and Bressler ign 
scribed the 


ort 
Sensitive areas of the 
Student in the Uni 


adjust" 
hited States and the a 50) 
ment Problems t} 


nat may result, Smith 6 te? 
reported that ndian ‘students rer i 
*ppalled by Americans, because of the la? ie 
Pparent ignorance and disapproval ste” 
ndian culture, for instance, the caste p to 
and Poverty, T] e Indian students tend jiat® 
fact in a hostile 


manner and to retal 
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agai: 
E t Americans by referring to major 
Problem pri. for instance, the racial 
panese = — to the Indian students, 
asm of di ents appeared to react to criti- 
“enryo 3 tape country by employing 
Whether That is, withdrawal from Americans. 
Manner eith student reacted in a defensive 
AG er counterattacking as the Indians 


Or wi 
ith e 
drawing as the Japanese, seemed to be 
Those 


'5 that is, in order to maint 
* has to deny his negative attitudes 
Journey? a defense mechanism. Thus, the 
Oink, - negative attitude toward the host 
îttitude may be a function of his negative 
ae ne his own country and of his 
urn} eness, rather than of his actual so- 
e ng experience. 
Scripenetts Passin, and McKnight (1958) de- 
Stude the adjustment patterns of Japanese 
“ents in the United States, based upor 


ud 
tests derived from interviews and a battery © 
Wop, Ihe adjustment patterns of the students 


es escribed in relation to three personality 
and 4 namely, the Adjuster, the Constrictor, 
lo he Idealist, The Adjuster is able to adapt 
in 2m the Japanese and American ame 
tien to assimilate attitudes from both 
ig es. The Constrictor is conservative 1N 
With, titudes, and his behavior is congruent 
Is libe rthodox Japanese culture. The iei 
tura ral in his attitudes and is open to Cu- 
Si je, enge. In another study of TS 
ree ners, Nash and Shaw (1963) descr ec 
o a Personality types of Japanese immigrants 
VORRN which he termed the Autonomous, 
" ional, and Transitional. 
Shi, *rlmutter (1954) investigate t 
Ment etween the American sojourner’s adjust- 
for ges xenophilia, that is, the preference 
ag, ngs and persons foreign. Sojourners were 
p, stered ü scale consisting of sentences 
"he j; eme foreign and domestic objects. 
ers ae were defined as those sojour- 
Ome ho chose the foreign in preference to the 
Stic object, for instance, “Most European 


d the relation- 
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girls make better wives than American girls." 
In spite of the Xenophile's preference for things 
foreign, he tends to isolate himself in American 
enclaves while living in the host country. 
Perlmutter suggested that the Xenophile tends 
to fantasize about "a better world abroad," 
regardless of where he may live. It appears 
likely that the Xenophile is a dissatisfied person 
who is trying to maintain his self-esteem by 
holding his environment responsible for his 
feelings of dissatisfaction. Pool (1965) has pre- 
sented a more extensive examination of the 


significance of xenophilia in understanding the 


sojourner. 

Lundstedt (1963) stressed the importance of 
tors in sojourn research. He 
a closed mind and ethno- 
he attainment of effective 
overseas adjustment, and that the more effec- 
tive coping mechanisms in adjustment consist 
of rational attitudes, universalistic tendencies, 
ss, and flexibility. A similar 
viewpoint was taken by Gardner (1962). He 
suggested that the “universal communicator” 
will have the least difficulty in adjusting to 
another country. The universal communicator 
was described as having a well-integrated 
personality, a central organization of an ex- 
traverted type, a value system that includes 
the “values of all,” a socialization of cultural 
universals, and a high degree of sensitivity 
toward others. i 

'The descriptions of the potentially good 
adjuster, portrayed by Lundstedt and Gardner, 
appear to be commonly accepted in the litera- 
However, the description 
is based primarily upon face validity rather 
than empirical data. In spite of its appeal to 
common sense, the description may be over- 
simplified and possibly incorrect. For example, 
Bronfenbrenner, Harding, and Gallwey (1958) 
found evidence that general interpersonal 
sensitivity is associated with passivity and 
withdrawal; hence, extending their findings to 
intercultural interactions, the sojourner who 
has a high degree of sensitivity may be at a 
disadvantage in adjusting to a foreign culture. 
Also, Byrnes (1966) described a number of 
studies about overseas American technicians 
and reported that the evidence did not support 
an “overseas type,” that is, the successful 
overseas technicians did not show any consis- 


personality fac 
maintained that 
centrism may limit tl 


open-mindedne 


ture on sojourning. 
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tent personality patterns that would distinguish 
them from unsuccessful technicians, Addition- 
ally, in a study by Smith (1966) dealing with 


tern or with administrative evaluations of 
volunteers in the host country. 
The conceptualization of 
the sojourner in 
Social patterns, 
characterizations, 
interesting, entert; 
dotal, H. 


the adjustment of 
terms of typica] behavior or 
and Personality typologies, 
traits, and Constellations js 
aining, and Somewhat anec- 


lucid 


tinctions between them; the Particular pat. 
terns, typologies, and t 

varied as the investigators them: 
Some authors see i 
Particular beh 
adjusting whi] 


as 
selves, Also, 
nd emphasize 
or modes of 
antecedent, back- 
or social determinants, 


to distinguish 
and Consequents in re] 
terns, typologies, and traits 
being a lack of clarity, Most of 
ave an impressionistic 


establishing the Social patterns 
typologies of soj i 


approach in 
and personality 
out empirically 


er clearly 
alized in any comprehen- 
sive manner, Although it seems unlikely, these 
descriptions may have some futur 


© value in 
understanding intercultural adjustment, but 
at the present time they are little more than 
h 


Ypotheses based on limited data 


and genera] 
Impressions, 


BACKGROUND AND SITUATIONAL FACTORS 

A somewhat Sreater degree of success in 
predicting intercultura] adjustment has been 
based on background and situational factors, 
Smith et al. (1963) reported that the religious 
affiliation of Peace Corps Volunteers was 
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How- [ 
1 orale. 
related to adjustment, that Is, mor 


ol, an 

ever, age, type of undergraduate E. a- 
previous teaching experience Shom tigation o 
tionship to morale. Another d that paternà 
Peace Corps Volunteers found id signi- 
absence during childhood peg ul 
ficantly between successful and 2 proportion 
volunteers (Suddenfeld, 1967). The at sign” 
of volunteers from fatherless homes intet 
ficantly greater among unsuccessfu xorale We 

Smith et al. also reported that Titi in 
higher for Peace Corps Volunteers ing in mo 
near the major cities than those "€ H 
rural areas within the host country. an scho?» 
morale was not related to age of a or work 
subject matter taught, teaching seat an 
ing alone. In another study, Gu 1[-satisfat 
Gullahorn (1963), using a 13-item adjustment 
tion questionnaire to measure Guest ee 
found that American grantees in ved ia 
better adjusted than those who sta) 
Near or Middle East. 
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USTM 
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" lal hitai © er 
The degree of Social IBÉGraqtiOn. sojou? 5 
between the host national and the er 


> Ja 
to the 1a% on 
has been found to be related to tl e 
adjustment, Fo 


avids 
r example, Sewell and Da V^ i 
(1961) 


c 
r cont 


A psychiatri 
in Columbi ore 
interaction result in adjustment Pfcst 
(León, 1963). The Women patients pue) 
rotic symptoms of Én sub 
a, heavy drinking, etc., and Le * nav 
gested that a woman sojourner ted i 
greater difficulties than a man in dap Jes? 

i Y because the woman ibe pro 

Y to interact with others. T m hi? 
€ssional man can derive satisfaction V con” 
Se in social interaction i! ewif 

is job, Whereas the hous ia 


Junction with 


e 

3 ew avenues of Social contact OPO, 3) 
her So Gullahorn anq Gullahorn i jgh 
reported that American Sra 


Dtees who ha 
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adju 
mith he = Scores tended to maintain contact 
Matters a consult on professional 
Sst country Colleagues and students in the 
Stants in Y, and help these hosts to apply for 
M the United States. 
the ee sections of this article described 
Severa] er between adjustment and 
Personali tables, that is, patterns of behavior, 
lors, Th d background, and situational fac- 
Variables Se sections that follow, some of these 
` are seen to be related to adjustment 
= they either facilitate or inhibit 
ewel] eraction. 
Cree o and Davidsen (1956) determined the 
Students Social interaction that Scandinavian 
Culture, p had with aspects of American 
9f the st he measure of social contact consisted 
With tudents? self-reports of social interaction 
tommers ect to American homes, churches, 
än Genk recreation areas, campus activities, 
Stound "s within the United States. Back- 
m *racti actors found to be related to social 
M the ion included high socioeconomic status 
Student's home country, urban back- 
oe contact with Americans, and 
So] ge ‘ants as opposed to a physical science 
ai background. 
Sojo = (Sce Footnote 2) stated that the 
ipee S geographical background is an 
He ee factor influencing social interaction. 
Student that European in contrast to Asian 
$, while sojourning in the United 


tat 
eS, ha r ie s E 
Ame.” had more social interactions with 
students in 


eric, 
Eur Cans, Similarly, American 
hose hate ported more social interactions with 
Ewel] onals than did Americans in Asia. 
tha i and Davidsen reported other factors 
of org enced the amount of social interaction 
Student Students in the United States. Those 
Feauent Interacted with Americans most 
United S whose purposes for coming to the 
an tates were social or cultural rather 
h professional. Also, those students who 
f pi aba less frequent and less severe 
Bage Ing intercultural experiences tended to 

Sts, more in social interactions with the 
anq Wi another study, Goldsen, Suchman, 
i em Bis (1956) investigated the social 
dg ction of Americans on a foreign campus, 

Ost Bn that voe xe who participated 
Dite," the campus ife revealed a general 

rn of friendliness, and were less critical 


an 
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of specific issues and practices of the United 
States. However, the degree of social interac- 
tion was not related to the social, political, or 
economic interests and attitudes of the stu- 
dents. Kelman (1962) suggested that inter- 
cultural interaction may be increased by the 
sojourner’s participation in an ongoing enter- 
prise, maintaining the self-esteem of the host, 
and by introducing common goals as the result 
of combining his effort with the hosts on a par- 
ticular task. Finally, the results of several 
studies (e.g, Morris, 1960; Selltiz, Christ, 
Havel, & Cook, 1963) have indicated a posi- 
tive relationship between one’s fluency in the 
host language and the extent of social inter- 
interaction with the hosts. 

The studies cited in the previous sections 
indicate that social interaction is a relevant 
variable in understanding the adjustment of 
the sojourner. More specifically, it seems that 
various background and situational factors 
influence the sojourner’s social interaction with 
his hosts, which in turn influence his adjust- 
ment. However, the term social interaction has 
not been clearly specified by the majority of 
investigators with the result that the concept 
remains somewhat vague and elusive. For ex- 
ample, does social interaction refer to talking 
to host nationals? To the number of people the 
sojourner talks to? To the amount or type of 
talking? Does it refer to the situation in which 
he is merely present with others without actual 
conversation going on? Does it refer to the 
number of hosts that the sojourner meets, 
knows, or is close friends with? Thus, there are 
many ways in which a sojourner can interact 
with his hosts, and it would seem necessary to 
specify more clearly what is meant by social 
interaction in order to be able to relate dimen- 
sions of social interaction to the sojourner’s 


adjustment. 
INTERCULTURAL COMMUNICATION 
The study of the adjustment of the sojourner 
should not emphasize the social interacticn or 
contact experience per Se, but rather the dimen- 
sions of interpersonal processes that occur 
during these encounters. What is most impor- 
tant, then, is not the mere occurrence of the 
contact Or interaction, but instead, what 
actually happens during the encounter. 
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di - 
fo Men The subtle and yet very 
not be raion erences: between cultures may 
may be derstood, with the result that what 
"most eere by the sojourner as the 
ceived b 2 E of intentions,” may be per 
insult la e host-country nationals as 2n 
etween 2e An awareness of differences 
crucial Em tures. would seem, then, to be 
Cation, effective intercultural communi- 
very dim wi developing such awareness 1$ 
ties of en t, primarily because of the subtle- 
Stated mie differences. As Kluckhohn (1949) 
do ti en he compared a culture to a map, 
Won't Rs E accurate and you can read it, you 
now yo SES if you know a culture, you will 
b. 28] uir way around in the life of a society 
culture j Ina recent study of the Sinasina 
t € in New Guinea, Turner (1968) stressed 


Mat; : 

“cult à sojourner should understand the 

EV theme” of the host culture in 
to establish adequate intercultural 


munication. 
chuetz (1944) described the role of a 


Tanger i $ " 
mo di In a group, in which the stranger must 
y his frame of reference in order to 


e 
es aware of the implicit structure 
‘ly ken S the group. The stranger cannot 
Structure 22 notions about the norms and 
Cebteq Tit the group if he is to become ac- 
tan ie ikewise, the sojourner in his role as a 
aware yt to a foreign culture needs to become 
More di ts implicit rules. This is no doubt a 
ing y ifficult task for him than were he adjust- 
Ba à new group within his own culture. 
ie (1959) and Hall and Whyte (1963) dis- 
Cation abs process of intercultural communi- 
dune stated that periods of time spent in 
of the * culture may bring to awareness aspects 
Previo ojourner’s home culture of which he was 
aware usly unaware. A sojourner becomes 
emplo of the many subtle cues that he has been 
i ying within his own country, because 
iue are no longer effective for communi- 
n within the host culture. An awareness 
Eos Mai formerly unconscious cues for 
pmmication, Ms be an important con- 
ition to effective communication within 
ns host culture, because the awareness of a 
may enable a Sojourner to modify his 
Patterns of communication so that fer ante 
Congruent with the cues of the people SR the 
OSt country- Therefore, if a sojourner can 
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consciously direct his behavior, the likelihood 
of his becoming better adjusted would be in- 
creased. In terms of the downward slopes and 
low points of the W curve, adjustment would 
be evidenced by a reduction in the slope and a 
reduction in the severity of concomitant 
culture-shock symptoms. 

Intercultural communication can 
ceived in terms of verbal communication, non- 
verbal communication, and complex inter- 
relations between the two. The most obvious 
mode of communication is through the use of 
language, and its importance to understanding 
adjustment has been indicated by a number of 
studies. 

Deutsch and Won (1963), for example, re- 
ported that language facility of trainees of the 
Agency for International Development was à 
factor influencing the degree of their social 
satisfaction or adjustment. And in a study of 
Scandinavian students in the United States, 
Sewell and Davidsen (1956) reported, also, that 
fluency in the host language was à necessary 
condition for a satisfactory adjustment. How- 
ever, Major (see Footnote 2) reported that the 
English fluency of foreign students in the 
United States had little bearing on either their 
social or academic adjustment. Major suggested 
that the foreign student's confidence in his 
use of the host language had a greater influence 
on his adjustment than actual facility in the 
host language. Gullahorn and Gullahorn (1966) 
suggested that confidence in the use of lan- 
guage, regardless of ability, may be a function 
of a more general confidence in one's ability to 
interact with others. Within certain cultural 
contexts the sojourner who has confidence in 
his ability to speak may be encouraged by 
foreign nationals whenever he uses the host 
ver, Gullahorn and Gullahorn 
all countries are equally en- 


be con- 


language. Howe 
noted that not 


couraging of the foreigner who attempts to 
speak in the host language. For example, 


whereas a sojourner to the United States will 
be likely to find his attempts to speak English 
generally encouraged, in France the sojourner 
may find his hosts relatively intolerant of 
misuse of the French language. The sojourner 
who has strong tendencies toward verbal 
interaction may find such rebufis considerably 
frustrating. Other aspects of language have 
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been covered by Bennett et al. (1958), Feld- 
man (1968), and Watson and Lippitt (1955). 
Nonlinguistic as opposed to linguistic aspects 
of culture have been emphasized by Glenn 
(1967), Loubert,* and Wight, Hammons, 
Bing? in explaining the 
cation that often exist 
host. And Hall (1959) 
culture in which verbal 
one of many possible m 
He formulated a “ma 
be represented by a 


and 
barriers to communi- 
between sojourner and 
Proposed a theory of 
communication is only 
odes of communication, 
P of culture” which can 


10X10 matrix of cells. 
While the axes of the matrix consist of *pri. 
mary message systems" or Systems of communi- 
cation, the cells in the matrix include the 
totality of activities in any culture, Thus, Hall 
conceived of culture as à vast system of com. 
munication composed of these message Systems 
into which all of the aspects, activities, and 
elements of a culture may be categorized. Each 
cell has associated with it a “meaning” or 
communication value that is generally under- 
stood by the majority of members of a given 


Culture. Since only one cell of the matrix is 
Used to represent the Process of verbal commu- 
nication, all of the 


remaining cells pertain to 
messages that occur nonverbally and largely 
Out of awareness. "Therefore, nonverbal, 
comparison with verbal Communication, would 
seem to be of much greater significance within 
the culture (see Duncan, 1969, for a recent 
review of nonverbal behaviors). 

Hall contends that the greatest confusion in 
intercultural communication can be traced to 
a failure to interpret correctly the subtle cues 
that precede and surround a verbal communi- 
cation, A person may understand the actual 
spoken message, but may be totally unaware 
of the nonverbal messages that are occurring 
on the unconscious level. This unconscious com- 
munication is described as following certain 
culturally determined and relatively inflexible 
“rules,” and may be discovered through analy- 
Ses of nonverbal behavioral patterns. Often 
the Unconscious behaviors become known to 


‘ Loubert, J. D. The trans-cultural res 
ing institute (TCI). (ONR Tech. Rep. No. HSR-RR- 
67/7-Cs, — Contract Nonr-4346-(00), NR-177-238) 
McLean, Va.: Human Sciences Research, 1967. 

^ Wight, A. RS Hammons, M. A., & Bing, J. Cross- 
cultural and community involvement training. (Peace 
Corps Tech. Rep., Contract PC-25-1710) Estes Park, 
Colorado: Center lor Research and Education, 1969, 
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à tively 
the sojourner during the period of rn 
greater adjustment difficulties. In n were 
ner’s home country many actions wet are 
automatic and primarily Lange’, cul- 
relatively inappropriate within the ocess 0 
ture and become conscious in the Pring t 
adjustment (see David, 1971). aroo has a 
Hall and Whyte (1963) sojourning ELE. 
"inoculation? effect, An American, tet erienc& 
by virtue of his previous travel modes 
becomes more aware of his own n nsciou 
behavior, which were previously unco aware- 
to him in the United States. This poi that 
ness helps, then, to reduce the impen quent 
interfere with understanding in su 
intercultural encounters. 

Apparently a variety of differences tanding 
cultures may contribute to misunders a f- 
in the communication process. Gunan vo 
ferences with respect to time and space ces and 
examples. Important cultural differen! mun 
their implications for intercultural co! le 


hat pe? 
cation may be found in the way tha 
from vari 


ous countries conceive time: if om 
scribed by Cleveland et al, (1960), nets 
American is invited to a seven o'clock. n Je 
he will Probably arrive between >) a will 
minutes after Seven. A Scandinavia! it 1 
probably arrive exactly at seven, af , may 
likely that he will bring a gift. A Lau may 
arrive at nine o'clock, and an Ethiopia "ot 


i ay 
arrive somewhat later, A Javanese m2 


e 
A z acce 

ugh he had politely ien Pom 

M order to prevent the hos 


to wait 45 minutes ulteds 
nt will likely be ae a 
atin American co a un- 
would be considere 


:Geanct 
al communicative significa" r 
avior and its implications, 
Intercultural communication, in par 1089) 
ave been discussed by Brein ^ and ca velit 
1966, 1968). When two persons from dif dings 
cultures interact, various misunderstan 
may result from each interactant's wo 
insight into the Other's use of T | dif- 
Understandings can arise from crosscultura class 
erences in territoriality. The middle-c'a 
* Brein, M. 


Psychoproxemies, Qj 
Script, Universi 


anu- 
npublished m 
ty of Hawaii, 1970, 
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Ameri 
an vides pud believes that he has 
is own, wh uet tto an office, room, or desk of 
not have oo his English counterpart does 
Such territ € same apparent need to establish 
o Os Staten Also, the American tends 
or instance AINE rights in public places, 
another per. : he is sitting at a public counter, 
is m is not expected to attempt to 
According t hs: his seat; however, the Arab, 
really is s Hall, believes that a public area 
ave files blie, and that a person does not 
cause he o to an area or location simply 
number of occupies 1t. In such encounters, any 
potential misinterpretations of be- 


avio 

Toc 

Die could occur between Arabs and 
€rlcans, 
he 

th conversational interaction distances 


üt pe 
c pw maintain between themselves have 
atin Fw to vary across cultures. Arabs, 
Sroups imericans, Indonesian, and African 
Thaintainine = others, have been described as 
ances , ing generally closer interaction dis- 
(Argyle & De themselves than Americans do 
all& wA 1965; Hall, 1959, 1963, 1966; 
Cultures ; yte, 1963). Hall suggested that since 
Mteracti apparently vary In their norms for 
feti + distance, people from these cultures 
and may their spatial experiences differently 
Ings a not necessarily attach the same mean- 
eir their space usages. Hall emphasized 
. Implications these differences might have 
y Tespect to intercultural communication 
108 in an example of the misunder- 
Cultural that could result from the different 
) hens Lees attached to the use of space. 
hvers Latin American and a North American 
Conver, €, both try to maintain their normal 
sation distances with 


Wi 


th 
€ res 
esult « move close, 


"€ with . . that when they [Latins] r 

think hdraw and back away. As a consequence they 

We. 2n € are distant or cold, withdrawn and unfriendly. 

reat $ the other hand, are constantly accusing them of 

Our f, ing down our necks, crowding us, and spraying 
aces [1959, p. 164]. 

Al P 
np ing Hall's observations are largely 
end sionistic, some experimental evidence 
action to support his views regarding inter- 
m distances (Brein, see Footnote 6; 

iis & Larson, 1968; Sommer, 1967; 
s son & Graves, 1966). Other investigators 
Ee studied crosscultural differences in 

atia i i 3 

tial behavior by using object arrangement 
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procedures. Based on the methods of Kuethe 
(1962a, 1962b, 1964), subjects arranged pairs 
of dolls or paper human cutouts into simulated 
conversational interactions in any manner 
they wished onto backgrounds or fields. The 
distances resulting between these figures have 
been found to correspond to the actual dis- 
tances that persons maintain between them- 
selves during conversational encounters. Little 
(1965, 1968) termed these spatial separations 
“reproduced interaction distances,” and ob- 
served that Mediterranean groups (Greeks and 
Italians), in comparison with Northern Euro- 
pean groups (Swedes and Scots), placed dolls 
significantly closer together during simulated 
“conversations.” Engebretson,* in a similar 
study reported cultural differences between 
Americans and Japanese. Tt seems, then, that 
such object-arrangement tasks may be po- 
tentially valuable for studying interaction dis- 
tances in relation to the processes of intercul- 
tural communication. 

Triandis, Vassiliou, and Nassiakou (1968), 
in a cross-cultural study of social perception, 
suggested that two persons from different cul- 
tures interacting with each other could develop 
misunderstandings and misinterpretations of 
each other's behaviors as a result of differences 
in their perceptions of social behaviors. It is 
possible that they could perceive their shared 
encounter “in very different terms” with re- 
spect to the exchange of reinforcements 
occurring between them, the degree of intimacy 
of the encounter, as well as their relative dif- 
ferences in status. For example, 


from one culture may provide what he con- 
be friendly criticism to à person from another 
o discover that the other person interprets 
a person from culture A behaves in 
he considers extremely “positive” 
toward a person from culture B. However, the indi- 
vidual from culture B perceives the behavior as “neu- 
tral,” and in turn, the individual from culture À feels 
that he is "given the cold shoulder” Ep. 33]. 


a person 
siders to 
culture only t 
it as “hatred.” Or, 
a manner which 


Triandis et al. also gave à specific example of 
Greek and American differences in the percep- 
tion of certain kinds of social behaviors, and 


discussed the implications of these differences 


D. E. Cross cultural variations in 
territoriality: A base-line determination of interactional 
distance between shared culture dyads. Unpublished 
doctoral dissertation, University of Hawaii, 1969. 


7 Engebretson, 
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for intercultural interactions between Ameri- 
cans and Greeks, 


Fayerweather (1959) described how Ameri- 
cans feel complimented when they are told that 


they are open, direct, and approachable; 


however, Mexicans are Seen to regard openness 
to be a form of weakness or treachery. The 


Mexican can give in or humiliate himself, but 
he should never allow the outside world to 
penetrate his intimate being. The Person who is 
open is not to be trusted. He is a traitor or a 
man of doubtful fidelity, one who tells secrets 
and is incapable of facing dangers as they 
should be faced. In an article about the first 
Peace Corps Volunteers to the Philippines, 
Guthrie (1966) described how the volunteers 


embers of the staff to 
in discussions with the 
a devastating Piece of 


ationships 
ecause Philippine social 
d upon “smooth inter- 
ps.” Szanton (1966) de- 
€rpersonal relationships as 
Potential disagreements by 
ignoring or denying them, or calling on a third 
party to negotiate a compromise. Adams (1962) 
discussed some of the differences between value 


Koreans and Americans and 


According to Cleveland et a]. ( 


1960), the 
American sojourner may h 


ave difficulty adapt- 
ndirection and innu- 
a way of life. For example, in some 
Asian languages the word for “no” js rarely 
used or is the same word as “yes”; thus, the 
Oriental “ves? may mean “maybe” or “no.” 
The Oriental may consider it better to say 
untruth than to cause another person to | 
face. Cleveland et al. presented a story 
àn Indonesian middle-class bo 
marry an upper-class girl. 
Went to the house of the gii 
banana was served wi 
most unusual combination. 
not discuss the marri 
but the boy's mother 
was unaccept 


an 
ose 
about 
y who wanted to 
The boy’s mother 
l’s mother for tea, 
ith the tea, which was a 
The women did 
age during their isit, 
knew that the marri ge 
able —bananas do not go with te. 


a. 
The relevant information had been communi- 
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er, an 
cated, but nobody had lost face. Hi dips 
American might have picked up TO the 
and started to peel it while discussing 
wedding date. tural mis- 
The preceding examples of intercu tment 0 
understandings suggest that the um which 
the sojourner is related to those mo the 
differences occur between cultures. of the 
basis for ineffective  urxparae. in the 
sojourner would seem to be embed Seal cul- 
differences between the home and sations 0 
tures; consequently, further investig? able t 
cross-cultural differences should be “al com- 
establish the areas in which ieee e j- 
munication may be improved. Mord cutie 
cally, to understand the adjustment veces 
of an individual sojourner it would be D person- 
to relate specific background we con 
ality traits, and situational factors io n" 
munication process. Relating these fac rovid 
intercultural communication may iren Hi any 
an integrating frame of reference by W w e 
number of independent approaches CO to the 
compared and understood in relation the ef- 
adjustment of the Sojourner. In beer allow 
fective exchange of information wide n F 
for improved intercultural encounter orth 
velop, resulting in improved adjustment jt ral 
Sojourner, Although effective intere d 
communication seems to establish the nece that 
basis for adjustment, it does not meong 
the sojourner will show good adjust? ^. 
Sojourner may know how to commun” 6 
effectively in the host culture, but he ies dif- 
be able to either accept or reconcile -— 1 
ferences between Cultures. Although ! 
beyond the Scope o. 


in understanding ce 9 
Sojourner may be related to his acceptan 
the host culture once he has developed ef 
patterns of comm unication. 


adjustment © 


fective 
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involving risk is provide: 
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should begin with a close exam 
data concerning the phenomenon in question. 
In particular, two characteristics of the risky- 
shift phenomenon have not received adequate 
attention. First, while the generality of the 
risky shift has been documented, its limits are 
rarely acknowledged. Among the 12 hypothet- 
ical choice dilemmas in the questionnaire 
used in the original risky-shift study by 
Stoner’, there are two items (5 and 12) that 
y yield group shifts toward greater 
atism), though the magnitudes 
of these shifts are at best moderate (i.e., a third 
to a half of the mean risky shift). Other hypo- 
thetical choice-dilemma items that yield cau- 
tious shifts have been constructed by various 
researchers (see Burns'; Myers’; Nordhóy*; 


consistentl 
caution (conserv: 


3 Stoner, J. A. F. A comparison of individual and 
cisions involving risk. Unpublished master's 
thesis, School of Industrial Management, Massachu 
setts Institute of Technology, 1961, db he 
4 Burns, J. F. An extremity-variance 3 
a 3 F 5 analysis 
group de involving risk. Unpublished pr 
dissertation, M assachusetts Institute ^ 
1807. , titute of Technology, 
5 Myers, D. G. Enhancement of initial risk taking 


group de! 
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Rabow, Fowler, Bradford, Hofeller, & Shibuya, 
1966; Stoner, 1968; Vidmar & Burdeny, 1969). 

Another important limitation on the general- 
ity of risky-shift phenomenon is its failure to 
Show up in studies of betting behavior (Col- 
lins & Guetzkow, 1964; Hunt & R 


owe’; 
Lonergan & McClintock, 1961). Similarly, the 
risky shift h 


as not appeared consistently in 
Estes-type gambling situations where Subjects 
"bet" which of two lights would appear. Two 
Such studies found cautious shifts (Smith & 
Radinsky*; Zajonc, Wolosin, Wolosin, & Sher- 
man, 1968): a third obtained both cautious 
and risky shifts under different conditions 
(Zajonc, Wolosin, Wolosin, & Sherman, 1969), 


1969), the 
riskiness, 


success, 
- The second study 


also important to not 
studies demonstrating the risky 
defined only in terms of the od, 
(i.e., Probability) of an alternat 
action, The smaller 
riskier the d, 
the risky 


€ that in 
Shift, risk is 
ds of success 
ive course of 
the odds of Success, the 
€cision is considered to be, Hence, 
-shift phenon an appar- 
gei e for smaller odds of suc- 
cess. This, h and should not be 
of risk-taking be- 


havior, In Some studies, risk taking has also 


tendencies in social situations. Unpublished 
dissertation, University of Iowa, 1967. 

s Nordhgy, F. Group interacti 
under risk, Unpublished master’ 
dustrial Management, Massa: 
nology, 1962, 

T Hunt, E. B., and Rowe, R. R. Group and individual 
economic decision making in risk conditions, In D, W, 
Taylor (Ed.), “xperiments on decision making and other 
studies. (Tech, Rep. No. 6, AD 253952) Arlington, 
Va.: Armed Services Technical Information Agency, 
1960. 


doctoral 


ion in decision-making 
5 thesis, Schoo] of In- 


chusetts Institute of Tech- 


7 and Radinsky, T. L. The effect of 
diffusion and concentration of responsibility on the 


omenon in 4 two-choice situation. 
Paper presented at the Spring meeting of the Iowa 
Academy of Science, Iowa City, April 1969, 


ive with 
been defined as choice of an iege vale 
greater variance of probabilities ics choice 
(Coombs & Pruitt, 1960, 1961) or certainty 
of an alternative with greater ve (Marquis 
about these probabilities and values ming the 
& Reitz; 1969); Rar reviews emer ei: be- 
definition and assessment of cet. an 
havior, see Slovic (1964) and Co 1 

uang.? ry 0 
» If we are to develop an adequate er. 
individual and group decisions orte o 
certainty, it should include an exp fi ould 2N 
the decision-making process and sh e shifts 
count for both risky and Conserva Anite 
It should also go beyond the narrow licable t° 
of risk so far employed, and be APP a poses 
greater variety of tasks. The rudimer organiza- 
a theory emerge from the following org 
tion and review of the literature. 

The various explanations that have 
been offered to account for the risky ai 
be organized according to the onam 
sumed to underlie the effect. Four ty el 
pothetical processes can be distinguis tistical 
fective, Cognitive, interactive, and sta 


e 50 far 
t can 
g 05 
f hy- 


a 


ArFECTIVE HYPOTHESES duce 
rodu 
. re r 1 roS 
Making a decision in a group xt in 
positive or negative consequences fo into 
dividual. If he 


xquences 
takes these consequer 
account, and they 


affective Process is 
other words, if wh 


a an 
E + vision, 
influence his deci I 


said to be mer 
at the individual pam 
gain by virtue of the fact that he is ma under 
decision in a particular context ut 
group pressure, in the presence of ion) in 
under pressure to reach a quick iv Rum i 
fluences his decision, the process of influ termi- 
an affective one, Borrowing from the jwar 
nology of behavioral decision theory Gea 
Lindman, & Phillips, 1965), it can Dh from 
that affective processes involve a shi to an- 
one Payofl-probability decision matrix. the 
other, where the consequences of ep E 
decision ina group enter into wr 

Ithas been argued (Goodman)? thata cha 

LÀ Coombs, C. H 


zo theory’ 
+ and Huang, L. C. A portfolio th 
of risk Preference, 


J ichiga? 

(Rep. No. 68-5) Ann Arbor: Mic 

Mathematical Psychology Program, 1968. ituatio 
dl oodman, B. Group decision making in si 


^ i Iniverity of Michi- 
of risk, Unpublished manuscript, Univerity o! 
San, 1967. 


ns 


GROUP DECISIONS INVOLVING RISK 


i 2. 
Eu about risk level resulting from an 
Should mid is temporary, and therefore 
the grou em when the individual leaves 
E dE permanency ofa change 15 not, 
Processes sufh£jent evidence that affective 
has emi not operative. Once a change 
Stabilize d intervening factors may work to 
Ohe case 1 ommitment to the decision 1S just 
among ~ point. Similarly, the interaction 
fithera nese. members cannot be taken as 
an ea y or a sufficient condition that 
vidual ive process was operative. The in- 
Sequence es be affected by anticipated con- 
action E that are not mediated by the inter- 
Consequer much as he may recognize those 
coe but not let them influence his 
Altogeth pi he may fail to be aware of them 
her. 
dij, s byPotheses that fall in this category are 
tisk as ee responsibility, familiarization, and 
c oices), value (i.e., information about other's 


Dig; 
Wusion of Responsibility 


lem relevant to this hypothesis have been 
Ence " by Clark (1971). Though the evi- 
€restin TN against this hypothesis, it is in- 
that q g to note that the supporting evidence 
Opposite | exist is equally compatible with an 
j Cts ; S hypothesis. It can be argued that sub- 
he, willing to take greater risks in groups 
them Se of the potential prestige that will befall 
Word, if their choice leads to success. In other 
Cess il the emphasis here is on rewards for suc- 
e Ter than blame for failure. This might 
othe ed the "assuming of responsibility hy- 
ividuer According to this hypothesis, in- 
risks uals in groups will tend to take greater 
NS order (a) to demonstrate their willing- 
achie to assume responsibility and thereby 
äke Ms a position of leadership, and (b) to 
Wo jp for potential success. In tuitively the 
the p x potheses are equally plausible, and from 
Mun arch conducted so far it is impossible to 
guish them. 


P555: 
“miliarization 
5s pccording to this d poen when a subject 
tou. ssed to make a e he will take a cau- 
init position. Upon iis familiarization, his 
ie cautiousness VE . evaporate, allowing 
1 to endorse a decision involving greater 
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risk. Thus it is assumed that the process re- 
sponsible for the risky shift occurs before the 
individual is even exposed to the group or en- 
gaged in familiarization with the problem, 
that is, when he takes an initially rather cau- 
tious decision. Since the subject’s decision is in- 
fluenced by the circumstances under which the 
decision is being made, the underlying process 
is affective, although it also has some cognitive 
overtone. Since the bulk of the evidence is 
against the hypothesis, it suggests that the 
risky shift is not a pseudogroup effect (Clark, 
1971), but is actually mediated by some kind 
of group process. 


Risk-as-a-Value Hypothesis: Information about 
Others’ Choices 


This hypothesis has undergone several mod- 
ifications since first being advanced by Brown 
in 1965. In its general form, the hypothesis 
maintains that riskiness is valued in American 
culture. Hence, risk taking is socially desirable, 
and those who take risks conform to a cultural 
norm emanating from the cultural value of 
riskiness. Brown (1965) maintained that the 
consequences of the cultural value of risk in a 
group situation are twofold. First, the cultural 
value of risk biases the discussion and leads 
group members to raise more information and 
generate more arguments favoring the risky 
alternative. This biased flow of information was 
first noticed and reported by Nordhgy (see 
Footnote 6). Second, the cultural value of risk 
leads subjects to believe that they are some- 
what riskier than others and thus closer than 
the average person to their cultüral ideal. 
This was initially observed and reported by 
Hinds.! Each of these two consequences serves 
as a basis for a different version of the value hy- 
pothesis. Although usually ignored in discus- 
sions of the value hypothesis (e.g., Clark, 
1971), the distinction between these two ver- 
sions is of considerable importance since the 
assumed underlying processes are quite differ- 
ent. : : " 
The first version maintains that the biased 
of information and arguments favoring 


flow Ioni: 
alternative influence subjects to take 


the risky 

1 Hinds, W. C. Individual and group decisions in 
gambling situations. Unpublished master’s thesis 
School of Industrial Management, Massachusetts ine 
stitute of rechnology, 1962. 
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greater risks, They 
vinced on the basis 


, argues that during 
Who believe them- 
E n others are exposed 


up to what the cultural 
, they change their pri 


first hy 


Sion is being made, where Social con 
Drocess is the Specific 

1954), B 
fective hypothesis ( out others’ 
choices), believing that information about 
others’ cho mary cause of the 
risky shift, 


[Brown, 1965 p, 702]. 


In order to account for the conservative 
shifts found on Items 5 and 12 of the choice- 
dilemmas 


questionnaire ang o 
written by Nordhgy, Brown revised the hy- 
Pothesis, Thus it was hypothesized that any 


Ving risk elicits widely held 
alues that f; i i 


Most others, In the discussion, they 
it that others are in fact riskier than 
» and this will lead them to change and 
endorse Breater risk, The converse will be true 
for problems for which the widely held cultural 
values favor © Cautious decision. 

Most of the evidence that is generally cited 


in support of this version of the hypothesis is 
and ing i 


her ver- 
Port this hypothesis, as well aS. be o 
sion of the hypothesis, is the pro serall mean 
correlation between the tS risk across 
risk level, and the mean shift towar 


D 
i as questio! 
the 12 items of the choice-dilemmas q 


Footnote 
naire that was used by Stoner uh correla- 
in the original risk y-shift study. Kogan, ani 
tions were reported by Wallach, (1967) an 
Bem (1962), Teger and Proite 78, Sh 
Pruitt and Teger to be om peo reporte 
respectively. Another yere (b «00D 
by Stoner (1968) to be r — > items; inclu 
Stoner used à different set edi These 
ing 4 items that yielded cautious s ort for the 
Correlations are usually seen as supp: dicat 
dri 


odds). 


y are 
relations 
Unfortunately, what the ; ee Aine finds 
Supposed to indicate is not quite w. m For th 
upon closer examination of the data. : 


stal 

ita 

T $ : he inl” m 

items that yielded cautious shifts t e 
mean risk 


atv 

level is indeed on the ppt. 
Side of the Continuum, For the two cuestion" 
tive items of the rugis Giles aP E 
naire (5, 12; i.e., those that yielde 75 (see 
shifts), the average probability ted results 
Pruitt & Teger, Footnote 12, for poo! the ris ky 
based on six earlier studies producing used bY 
shift). For the four conservative e DB put; 
Stoner, the average is .76 (Stoner, dilemma 
for the 10 risky items of the morera risky 
questionnaire (.e., those that na (Pruitt 
shifts) the average probability is " only the 
& Teger, sec Footnote 12). Even D e., thos? 
four most risky items are taken G. rd. risk)» 
that produced the greatest shift a (1968 
this average is still .49, Only in ae risk-leve 
Study is the mean of the initia Nu 
Choices on the risky items somewha average 
risky side of the continuum, with an ort for 
of .42, These results provide little eit ye- 
the hypothesis, They make sense ar certi 
Spect to the Conservative shifts, bu 


set to- 
a shift a 
? Pruitt, D, G., and Teger, A. I. Is ee phe- 
Ward risk in group discussion? If so, is Peete at the 
nomenon? If So, what causes it? d ua logical Assoc! 
annua] meeting of the American P is dao 
ation, Washington, D. C., Septem! 


GROUP DECISIONS INVOLVING RISK 


Not f, 
© eo common risky shift on the 
he COS Lok ot cabins : 
is hypothesi ind of evidence in support of 
Son I Ma illuminates the social compari- 
Choices or i Before being exposed to the 
Jects eae ers during the discussion, sub- 
Nore risky eA themselves to be somewhat 
tisky shifts han most others on items yielding 
Yielding ca, and more conservative on items 
l; m E A shifts (Hinds, see Footnote 
"'allach & T Schneider, 1969; Stoner, 1968; 
Ng the proun ee” 1968; Willems, 1969). Dur- 
inaccuray UB discussion, the discovery of their 
Tespect t € assessment of how they stand with 
: ions others and the elicited cultural norm 
‘tems, cis them to take greater risk on risky 
» howey, smaller risk on cautious ones. There 
for 4 m MS an entirely different explanation 
The Pul that is considered later. 
Ort the hy kind of evidence advanced to sup- 
Vieweq ; hypothesis is that a high risk taker be 
Counter DIO favorably than his conservative 
and as on items that yielded risky shifts; 
lewed m IR the conservative risk taker be 
Batt on ore favorably than his risky counter- 
Madaras items that yielded cautious shifts. 
Ect a and Bem (1968) compared sub- 
risk teen of hypothetical risk accepters and 
i Nese ies on semantic differential scales. 
Era eoa figures were faced with di- 
naire rom the choice-dilemmas question- 
Items which included five 
ati o vield conserv- 
Yor the risky 
viewed more 
The risk ac- 


and other sources, 
slve Eos have been shown t 
‘tems th ifts. They found that f 
Vorably risk accepter was indeed 
Cente, y than the risk rejecter. ms 
Cessfu] Was rated as more strong, pij en 
What ? fast, hard, and masculine, an i p. 
teject more good and sociable, than the a 
"eese But on the cautious items, the es 
isk €r was not rated more favorably than me 
Still accepter. In fact, the risk accepter Was 
Hee ed more favorably than the risk re- 
Stat although the difference was much 
eae on most dimensions and nonexistent on 
n Ne. Similar evidence was produced by Lev- 
_ Ser and Schneider (1969). They found that 
2U5jects reported they admired choices riskier 
Mn their own, both on the 10 risky items of 
s choice-dilemmas questionnaire (statisti- 
Cally significant) and on the two conservative 
Ones (not statistically significant). These re- 
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sults provide some direct evidence for Brown’s 
risk-as-a-value hypothesis, that is, that on 
risky items, risk taking is socially desirable. 
However, they only suggest that the desir- 
ability of being perceived as à risk taker by 
others, or even by oneself, is what leads subjects 
to take greater risks in a group situation. 
These results failed to support the part of the 
hypothesis designed to account for conserva- 
tive shifts, namely, the contention that on the 
conservative items, cżution is prescribed by the 
cultural values and thus socially desirable. 

The results cited so far are in part compatible 
with other hypotheses. Therefore, the crucial 
evidence in support of Brown's hypothesis 
should be related to the mechanism underlying 
the shifts. That is, it must be shown that in- 
formation about others’ choices is a sufficient 
condition for shifts to occur. In a study con- 
ducted by Blank (1968), subjects made choices 
among gambling bets on tossing dice alone and 
in the presence of others also working on the 
same task. In the latter condition, subjects 
made their choices in turn but did not discuss 
them. The results supported the hypothesis 
ater preference for low-probability 
(ie, risky) bets when choices of others were 
known. In this study, however, the expected 
value of the low-probability bets was about 
three to four times greater than that of the 
high-probability bets, thus casting doubt on 
the comparability of its results to those of 
other studies in which expected values were 
usually constant. 

There is accumulating evidence indicating 
that information about others! choices is not a 
sufficient condition for shifts to occur; at least 
not for shifts of the same magnitude as the 
risky shift after group discussion on the choice- 
dilemmas questionnaire. Wallach and Kogan 
(1965) asked subjects to reveal their choices to 
each other on cards, change them if necessary, 
and reach a unanimous decision on the choice- 
dilemmas questionnaire. Subjects were not al- 
lowed to discuss the problems. No shift toward 
risk was found. Though shifts toward greater 
risk were produced in another study using the 
same procedure- with the exception of de- 
emphasizing the importance of reaching a 
unanimous decision -their magnitude was less 
than half of that found in the condition of dis- 
cussion to consensus (Teger & Pruitt, 1967). 


, 


showing gre: 
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In another study, 
sions on all 12 ite 
questionnaire and 


Subjects made initial deci- 
ms of the choice-dilemmas 
later discussed only half of 
them, thereby being forced to discover that in 
general they were relatively less risky than 
they had thought. A risky shift was found only 
on the items that had been discussed; there 
Were no shifts on the other items, The 
led the authors o 
“the crucial infor 
discussion is not 
information relevant to the sp 
cussed [Madaras 
similar experimental design 


Se results 
f the study to conclude that 


y Pruitt and Teger (1969). In that Study, the 
risk-taking task involved choices among bets, 
Agai 


a 
former, saying that 
some sort of communication among group members 
i is necessary for a risky- 
ation may even take the 
of exchanging information about pref- 


erences (Teger & Pruitt, 1967), though the effect is at 
its Strongest when a full gro 


up discussion takes place 
[Pruitt & Teger, 1969, p. 124]. 


abbreviated form 


In conclusion, in s 


pite of some indirect sup- 
portive evidence, Br 


own’s hypothesis that in- 
formation about others’ choices is sufficient to 


Produce shifts must be rejected, The evidence 
Suggests that the crucial factor responsible for 
both risky and Conservative shifts is s 
in the flow of informati 

being decided upon, 


edge of others? choices, 


Coenitive HYPOTHESES 

The cognitive 
overt change of 
Breater risk or 


hypotheses Suggest that the 
decision (i.e., shifts toward 
greater Conservatism) is the re- 
a covert cognitive opinion change, 
These hypotheses are concerned with altera. 
tions in decisions that result from changes in 
the cognitive view of the situation being evaly- 
an alterations resulting from di- 

ce of the circumstances under which 
cision ig being made (e.g., group discus- 
Presence of oth 
Borrowing 
havioral deci 
cognitive 


sion, 


rminology of be. 
ry (Edwards et al., 1965), 
hypotheses are concerned with 
changes within the Payoti-probability decision 


: iders 
matrix that the individual initially oett 
relevant for the given issue. It is still "cighing 
decision matrix that the individual e I$: the 
in the sense of containing the same fac tent 
relative desirabilities (utilities) of ton )) that 
outcomes and the likelihood Ca "e out- 
each course of action would lead to ce a re- 
comes. Thus, his decision may yeso relative 
sult of a change in his judgment of th 


: b- 
r their pre 
utilities of the outcomes and/or th 
abilities, 


n ole- 
. „mation Re 
Risk-as-a-Value Hypothesis: Informatio 
vant to the Task 


: revi- 
This hypothesis was considered in ep cau- 
ous section, Recall that the oa be re 
tious shifts required that the hypot E elicit 
formulated. To wit, any given proble r eithe! 
widely held cultural values that ae thus 
the risky or the cautious alternative, lirection- 
information flow will be biased in pa ài shifts 
However, the crux of the matter is t ee the! 
are produced because subjects ghapg grmi 
view of the situation by virtue of the ee is- 
tion and arguments being generated Be 1o 
cussion, rather than by mere Spon 
choices of others, In other words, b. “dis: 
Brown’s contention, the content of i “con 
cussion and the arguments “pro” and 


ible 
-esponst 
* primary factor resp 


e 


Because y 
value was 
about-other. 


a 
sale as © 
js Ser 

Most of the research on "rmation 

c a 

Seared to test the infor! 


m 
z t : forme 

S-choices hypothesis, the M 
tion-relevant-to-the-tash hypothesis & 
only indirect 


or 
eo indirect SUPP 
Support, This indirect sup! 
stems from fin 


the information-about-others choices ri 
esis. There is evidence that some sort en „ant 
munication in the form of information Me 

to the task is necessary for the full wi 
Shift to occur (Madaras & Bem, nee in 
& Teger, 1969). Also compatible with E 
wrmation-relevant.to. the. (acl. hypothe 

the evidence that subjects’ pers 
Strongly affected by Changes of infor -— $ 
task, as well as by n ý 
ng the discussion. Tinan duce 

al. (1966), changes intro the 
€ms resulted in changes hé: 
and dirctions of the shifts. Nore rgu- 
ally noticed the biased flow of arg 


Who origin 


rue 


GROUP DECISIONS INVOLVING RISK 


] I ior of riskiness, introduced two 
mas eten on Item 12 of the choice-dilem- 
tom a N that resulted in reversal 
town oa, to a risky shift (cited by 
Pile wh, 5). This item involves an engaged 
Counselor morae the advice of a marriage 
tied. A "ie out whether they should get mar- 
item Sie que argument in discussions of this 
à giae: to be persuasive: "TE they sought 
therefore r e advice they are not really in love; 

* odds ¢ à should not risk marriage unless 
are high ri epe married life in the future 
he cou il d hen additional facts were added— 
incidental WELG in love and met the counselor 
ris Was y at a party—a shift toward greater 
the choi observed. In discussions of Item 7 of 
SF A tg page questionnaire, where the 

Esenta, of the risky shift is greatest, the 
Suasive author has noticed that a particular per- 

his ite argument is often brought up- 
aying m involves a low-ranked chess player 
questio BERIISE a top-favored player. The 
tisky sis arises whether he should attempt a 
ictory ee that might bring him a quick 
cisions i ee were willing to make riskier de- 
ayer di lenever someone suggested that the 
odds of tempt the risky maneuver, even if the 
“other a TREES) BTE low, by arguing that 
Braue he would lose anyway, since as a 
aS the t ed player he is obviously not as good 
A op-favored player." 

M line of evidence fay hy 

exhibit | has already been mentioned. Subjects 

"ssio the risky shift after listening to real dis- 

individ, oF the choice-dilemmas questionnaire 

19 ually (Kogan & W allach, 1967b; Lamm, 

by ü ihe riskier group decisions are accepted 

rien FUE as their private decisions 
is x 1967; Wallach & Kogan, 1965), and 
hasi EE endures for at least G EM 

Perma, on"; Wallach et al., 1962) indicating the 

anence of the change that led to the shift. 

E summary, although all the evidence is at 

peers with the information-relevant- 

Subje "task hypothesis, the latter has not been 

est cted to direct test. l'or a rigorous direct 

Parti uf would be necessary to point out the 

cular information that produces the shifts. 


avoring the hy- 


t fata ^ 
et Johnston, La Jr: The immediate and short-term 
.*Cts of group discussion and individual study ou 


risk T 
isk-level, Unpublished doctoral dissertation, Florida 
late University, 
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Rationality: Expected Value and Subjective Ex- 
pected Utility Hypotheses 


The most elaborated formulation of the ra- 
tionality hypothesis was offered by Kogan and 
Wallach (1967a), though it was initially dis- 
cussed by Brown (1965). The hypothesis is 
based on behavioral decision theory (for re- 
views of behavioral decision theory, see Becker 
& McClintock, 1967; Edwards, 1961; Edwards 
et al., 1965). Briefly, behavioral decision theory 
is concerned with choices among alternative 
courses of action. The variables that are dealt 
with in the theory are the values and utilities 
(personal subjective value) of outcomes result- 
ing from alternative actions, and the probabil- 
ities and subjective probabilities that each 
course of action will lead to a certain outcome, 
One useful strategy is to choose an alternative 
that maximizes average gain or minimizes aver- 
age loss. The average gain or expected value 
(EV) of a given alternative action is a function 
of the probabilities (pi) that certain outcomes 
will follow the particular action, multiplied by 
the respective values (v;) attached to these out- 


comes, as follows: 


14; Dyeing OUTCOMES. 


n 
EV =} pas += 
i=l 
Similarly, when subjective average gain is taken 
to be the criterion for choosing an alternative 
action, it is determined by the multi- 
e subjective probabilities (spo) 
utilities (2) as follows: 


course of 
plication of th 
and the respective 
n 
SEU = X, Spitti, 
i=l 

where SEU is subjective expected utility. 

Basic to the rationality (expected value) hy- 
pothesis regarding the risky shift is the equa- 
tion of a rational decision with the choice of an 
alternative that maximizes expected value or 
subjective expected utility, and the assumption 
that risk taking represents a more rational 
maximizing approach in the choice dilemmas 
that yielded risky shifts, and vice versa for 
those that yielded conservative shifts. Once 
this assumption is made, the hypothesis can be 
stated as follow: \ group is in a better position 
than an individual to unearth information that 
will aid in the accurate assessment of the prob- 
abilities and utili es associated with the choice 
alternatives. This will result in shifts in the di- 


l 
| 
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rection required to maximize subjective ex- 
pected utility. It remains to be Shown that 
group decisions reflect better maximiza 
expected value or sub 
than individual decisio 
maximization of expe 


expected utility implies better utilization of the 
information already available to individual 


roup discussion; it does 


informa- 
) On the basis of two 
an and Wallach (1967a) argued 
. In one Study, expected 
value was increased for the more Conservative 
ach, & Kogan, 1965). 
expected value was the 
all alternatives (Wallach, Kogan, & 
o Clausen'4), Tn both cases, 
a risky shift was produced. In a more recent 
and Teger (1969), the risky 
Shift was also produced in a gambling task in- 
ves with the same expected 
values, Consequently, the rationality hypothe- 
cted if rationality is defined as 
Í expected value, 
Owever, these results may still be consist- 
€nt with the rationality hypothesis since it is 
possible that Subjects were maximizing sub- 
jective expected utility rather than expected 
value. Close examination of the four studies 
that produced evidence against the expected 
value hypothesis reveals that in each of these 
Studies either the utilities could well have been 
different from the explicit values (Bem et al., 
1965; Pruitt & Teger, 1969) or the subjective 
Probabilities could have been different from the 
objective ones (Clausen, See Footnote 14; 
Wallach et al, 1964). The latter experiment was 
reviewed by Clark (1971). Its results are 
also Considered to be the Strongest evidence 
in Support of the diffusion-of-responsibility 
Ypothesis, yet they lend themselves to differ- 
ent interpretations. The 
used in the study consi 
College Board | 
level was me 
determined 


?! Clausen, G. S, Risk takin 
lished doctoral di 


in small groups. Unpub- 
1965, 


University of M Tichigan, 


t can 
difficulty of the item was not mentee site 
be argued that group discussion a Answer 
jects’ confidence about their abili : m to take 
the item correctly, and influences t ie m. Alter- 
what objectively is a more difficult rne ablest 
natively, since the group can selec ould also 
member to tackle a problem, At pec with 
choose a more difficult item (i.e. Footnote 
lower odds of success). Clausen gm her ex 
14) utilized the same URDU. E correla- 
periment and indeed found a posty that 35 
tion between a subject’s riskiness, fidence. 
choice of a difficult item, and his he ma- 
It seems likely, therefore, that hag eriment 
nipulated in Wallach et al.’s (1964) por ence 
was not shared responsibility um Ei 0 
and subjective perception of the di - hypoth 
the items, In conclusion, a rationality SM 
esis that adopts maximization of xi choice 
expected utility as a criterion of rer well 4 
is still a viable explanation for risky ~ ibjectea 
lor conservative shifts and should be st 
to direct test, not 

A study by Stoner (1968), though tility hy- 
rect test of a subjective expected ut nd pro 
pothesis, Suggests a way of doing it d effort: 
vides results that certainly justify a, 8 
In his study, subjects were presented describ” 
Phrases, selected by the experimentet, choice 
ing possible outcomes involved m. tj ne pos” 
dilemma items, Each phrase described en 
sible outcome, Subjects were asked to e im- 
18 phrases “in the order in which they dy is 
portant to you.” (Although in the stu ins 
instrument is called a “value-ranking instr 
ment," it is clearly a ulility-ranking kë 
ment.) In addition, subjects were adii 
make the usual decisions on each item in usua 
ually, and were subjected to the ensus: 
procedure of group discussion to cons ing 2 
The results show that for items producing 
risky shift, the Successful outcome of Lies 
alternative was ranked higher in ME 
than the outcome of the cautious, more “Grems 
alternative, The converse held for the jts o 
that Produced cautious shifts. The resu e the 
this €xperiment may be misleading because > 
value-ranking instrument was nol constr E 
by a. valid Objective procedure and (per ae 
aSa result) did not include phrases for all y 
Sible outcomes, Consequently, the entire € 


$ = Sa on- 
Periment is Open to serious criticism even Ci 


tru 
u- 
to 


SS —-_- 
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Hase EET a 
E S mediate purpose, which was to 
e EUbicctive hypothesis. For a direct test of 
Utilities of ST expected utility hypothesis, the 
Tepresented. outcomes should be adequately 
Scribed py’ and properly dealt with as pre- 
ater that a prediction model. It is argued 
ion fioi great deal of important informa- 
Could be | the nature of the group process 
expected obtained by testing the subjective 
utility hypothesis. 


Th INTERACTIVE HYPOTHESES 
ot a tiw due cognitive hypotheses do 
ts to be the interaction among group mem- 
Only a co Mi eid or sufficient for shift; it is 
Sponsibitite ition under which diffusion of re- 
er a y or information exchange occurs. 
for it, pci conditions may substitute 
Ceive of ri chews interactive hypotheses con- 
Or shifts t interaction as a necessary condition 
vit ident Scour. They are concerned more 
uce the di ing the social processes that pro- 
9f in dare ect (e.g., disproportionate exercise 
he Psycho by some group members) than with 
affective hological mechanisms involved (e.g. 
Stitute S ES cognitive), and as such they con- 
© inter ass in themselves. At the same time, 
Other, mee hypotheses differ from each 
Some € assumes that shifts occur because 
Stoup PES s exert more influence in the 
disagree chon making than others. But they 
Mfluenc as to the reasons for this differential 
m wh e. In other words, their differences lie 
Bug i assumed to be the basis of the in- 
*risticg or instance, some personality charac- 
ence in majority influence, or greater confi- 
Make Wore ft positions. Of course, since they 
‘cal me eee assumptions about psycholog- 
Sarj bin anisms, these theories are not neces- 
nitive h Compatible with the affective or COS- 
n ypotheses. 
5 Lakers as Leaders 
ypo empirical evidence for and against this 
On 4 esis was fully reviewed by Clark (197 1 
p ne theoretical level the leadership hy- 
sonalio assumed the existence of stable per- 
kin y characteristics of leadership and risk- 
Cites” tendency. Most of the evidence indi 
ailur ‘that this assumption is invalid. The 
€ e to find common and stable leadership 
Aracteristics (Gibb, 1969) is as glaring as 
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the repeated failure to find convergence among 
risk-taking measures (regarding the lack of 
stable generalized risk-taking tendency see 
Goodman"; Kogan & Wallach, 1967a; Slovic, 
1962, 1964; Weinstein, 1969). Adding to this 
the negative results of studies that tested the 
hypothesis itself, it appears that the leadership 
hypothesis may safely be discarded. 


Extremity and Influence 

This hypothesis views the greater influence 
exerted by some group members, not as a func- 
tion of some stable personality characteristic 
(such as assumed by the leadership hypothe- 
sis), but as a function of the extreme position 
they take with regard to an issue. The same 


person, according to this hypothesis, may take 


an extreme position on one issue and influence 
then take a 


the moderate group members, but 
moderate position on another issue and himself 
be influenced by the more extreme members of 
the group. Extreme position is not limited to 
the choice of high risk levels (i.e., low probabil- 
ity of success) but includes low risk levels (i.e. 
high probability of success) as well. It is, there- 
fore, a hypothesis that could account for group 
shifts toward greater risk taking as well as for 
shifts toward conservatism. 

The extremity hypothesis was proposed by 
Burns (see Footnote 4) as a model to predict 
group shifts toward greater risk and conserva- 
tism. The amount and direction of shift is de- 
termined in the model by a regression equation. 
Burns argued that subjects respond on the 
probability scale as a Likert-type preference 
scale, so that choice of a very low probability of 
success indicates very intense and strong pref- 
erence for the risky alternative; choice of a 
very high probability of success indicates very 
intense and strong preference for the more 
certain conservative alternative. He also 
argued that as “intensity of attitude varies 
with attitude extremity is well documented 
by past research [p- 38],” it is the group mem- 
bers who endorsed extreme positions (i.e. 
either very high or very low probabilities), 
that are more confident and committed to 
their position- Thus, the greater exercise of 
influence during the group interaction is at- 

15 Goodman, B. Risky decisions by individuals and 


groups- Unpublished doctoral dissertation, Uni : 
of Michigan, 1970. ion, University 
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tributed to the holder of these extreme posi- 
tions. Two similar models were tested using 
choice-dilemma items on 180 group decisions 
and predicted the exact group decisions on 
49% and 53% of the cases, respectively. This 
is far better than any other model that has 
been Proposed, including a convergent aver- 
aging model that uses the mean of the initial 
risk-level choices as the predictor. The latter 
model could predict exactly only 36% of the 
group decisions. It was also shown that greater 
confidence was indeed associated with extreme 
Positions, 
Similar model S on extremity were 
also proposed by Myers, Murdock, and Smith. 
he basis for their prediction stems from the 
attitude change (Osgood, 
hich assumes 
ange of an attitude ob- 
S extremity or polarity. 
ere tested using choice 
£y predicted group de- 


S focusin 


Again, these models wi 
dilemma 


cisions significantly b 


etter than the averaging 
model, The inadequacy of these hypotheses is 
discussed later, 


the interactive hypotheses, 
Group Problem-Solving 
lo-Risk and Tnfluence-th 
potheses 


Analogy: Commitment- 
rough-Commitment Hy- 


"These hypotheses make 
remity hypoth 
exact nature 


the same predictions 
esis. The difference 
of the mechanism pro- 
Posed to account for disproportionate influ- 
ence. They all agree that confidence and com- 
mitment constitute this mechanism, Yet, the 
extremity hypothesis argues that confidence 
is the natural concomitant of taking an extreme 
Position on the scale, since it reflects intensity, 
whereas the hypotheses reviewed in the present 
Section Suggest different causes for the 
Confidence, They are based on an an 
group problem-solving studies 
hoc assumptions. 

Burnstein (1969) pointed out the 
to group de 
sults coming 


greater 
alogy from 
and several ad 


relevance 
sion making involving risk of re- 
from studies of individ 
Myers, D, G. G 
sponsibility diffu » drive enhancement, extremity 
influence, and i nation exchange effects on risky 
shift. Unpublished ma 
Michigan, 1969, 


ual versus 


, Murdoch, P., and Smith, G. F, Re. 
sion 
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P con- 
group problem solving. These SeS A 
cerned with whether group decisions H cisio 
correci than the average individual eriority: 
and if so, what accounts for this 3b) and by 
Early studies by Gurnee (1937a, 1937 rated the 
Thorndike (1938a, 1938b) peace that 
superiority of the group and pinpoi fident in 
“correct”? members were more con that by 
their choices. The inference is aciei 
virtue of their greater confidence el to their 
members were also more committe oughout 
choice and were more influential d 

the group discussion. Burnstein o 
following conjectural explanations: 


is an I 
Perhaps increased commitment or ker which A 
herent consequence of choosing an alterna so would be 
more likely to lead to failure, The paneo and ansk 
first, that the effort, in the form of extensi is greater p 
ous calculation, which precedes a de ap conserve 
the case of a risky choice than in the case o fiect 0 
live choice and, secon ]lowinÉ 
Choice. 
a risk - > correct, f 
either or both of these suppositions are | attrac 
risky decision is likely to take on added 4 
which could wel 


ne- 
wisdom of the; 
tion of traits 


o in 
age 
iskY 
decision makin 


ree 
tne, LIIS 
i ice. "^ 
€ not confident of their et hy 
we will labe] the “influence through comm 


d in re 


These conjectures h sing 
j ) us 


ave been teste 
search by Clausen (see Footnote Ne in- 
College Board problems as a ded 
Strument. In general, her research Be ent-t0- 
Supportive evidence for the commitm' found 
risk hypothesis, Greater confidence Pied diffi 
to be correlated with à choice of a Tap for 
cult (i.e, “riskier”) problem, particular patis 
Problems in the areas of ids aid c a 
relations, and sentence completion, W study 
risky shift in the Wallach et al. ue "nto" 
Was produced, In Problems dealing wit " cot- 
nyms and analogies, confidence was no an 
related with the difficulty of the problenm, a ` 
the risky shift Produced was relatively thesis 
€ influence. th rough-commitment hypo ative 
Was not Supported, because no conser e om- 
Shifts were observed—even when greater the 
mitment and confidence were displayed by 


ae group members. It must be 
these re de ever, that conclusions based on 
lege ad. are limited to studies using Col- 
ment de. tires as a risk-taking instru- 
some. difer ies using choice dilemmas showed 
related erences: Greater confidence was Cor 
P «.001) with extreme choices (p.29, 
tive one. 1 ‘both very risky and very conserva- 
in ease and resulted in risky shifts as well as 

n ceni shifts (Burns, see Footnote 4). 
extent TP the evidence that supports the 
mitment. hypothesis also supports the com- 
iene b and influence-through-com- 
Could 5 hypotheses. There is not much that 

EET e said about evidence differentiating 
hie epou from the extremity hy- 

ce ij l'esting the assumption underlying 
lence rrelation between extremity and confi- 
diii provide this evidence. There are, 
Blained | important results that are not ex- 
^ p by any of the interactive hypotheses. 
rodu results indicate that a risky shift can be 
isten us by individual subjects who merely 
mas q © group discussion of the choice-dilem- 
Nay ome an but do not interact in any 
Thout ogan & Wallach, 1967b; Lamm, 1967). 
tude c3 the shift is not of the usual magni- 
Cant. ie nevertheless substantial and signifi- 
X e Also unaccounted for are the results 
y thee that the group decisions are accepted 
cisions subjects covertly as their private de- 
and T (Lamm, 1967; Wallach & Kogan, 1965) 
able tat this acceptance endures for a consider- 
et dae (Johnston, see Footnote 13; Wallach 
her agas. - Since all the interactive hypothe- 
mara that the exercise of disproportionate 
fuv on is critical, they cannot account for the 
aan produced without interaction, un- 
ced suppose that the influence during in- 
live 25H is informational rather than norma- 
bi we Deutsch & Gerard, 195 ). The basis 
Gus: e last statement is made clear m the dis- 

sion that follows. 


STATISTICAL Hy POTHESES 


" freeording to the statistical models, a risky 
^ as well as a conservative shift is an arti- 
actual phenomenon. It does not reflect a 
Change of the individuals’ risk-level prefer- 
ences but appears as an inevitable outcome of 
an a priori rule of ordering and reconciling the 
individuals? preferences into one decision. Con- 
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sequently, no assumptions concerning the 
exercise of influence or other affective and cog- 
nitive processes are made. The important 
variables that account for group shifts are only 
the group decision rule and the distribution of 
initial risk-level choices. The models differ in 
what they assume about these two variables. 
Two decision-making models considered as 
alternative explanations of the risky shift have 
been reviewed by Burnstein (1969). One model 
is applicable when individuals’ rank orders of 
preferences or judgments are secured and can 
be compared (correlated) with a “true” or cri- 
terion rank order. The other model requires 
that the rank-order data be processed accord- 
ing to a majority-rule decision scheme or, 
alternatively, the processing of the single most 
preferred judgments of the individuals ac- 
cording to the same scheme. In the latter case, 
the mode (or median) judgment is taken as the 
pooled group judgment. These models are 
based on a test theory model (Eysenck, 1939; 
Preston, 1938; Zajonc, 1966). In brief, the 
basic model suggests that if there is an initial 
tendency toward accuracy—better than 50% 
chance of being correct, or à positive corre- 
lation between judgment and criterion—then 
the correlation between pooled group judgment 
(average rank order across subjects) and the 
criterion will be greater than the average in- 
dividual correlation. In other words, pooled 
group judgments will be better than the aver- 
age individual judgments only if (a) the major- 
ity of the individual judgments tend to be cor- 
rect (ie. exhibit some positive correlation 
with the criterion) ; or (b) the majority of judg- 
ments are correct and the mode judgment is 
taken as the pooled group judgment. If riski- 
ness (or conservatism) is substituted for cor- 
rectness, then shift toward risk (and conserva- 
tism) follows directly from the model. When 
the average individual choice tends to be risky, 
a risky shift is predicted, and when it tends to 
be conservative, à conservative shift is pre- 
dicted. Calculations were attempted on con- 
trived data and yielded results in accordance 
with the model (see Burnstein, 1969; Zajonc 
1966). Luce and Raiffa (1957, pp. 358-359) 
have suggested another procedure for aggre- 
gating individual choices to predict erate did 
cisions according to the majority-rule model. 
This procedure was attempted on the contrived 
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data and again yielded results in support of the 
model (Burnstein, 1969). 


What could the results of studies producing 


Shifts tell us about the majority-rule mode] as 
a valid explanation for group shifts? For one 
thing, to validate the model they would have to 
Show that on items that produced a risky shift, 
the average individual tended to be risky (i.e., 
the majority of individuals made risky choices), 
and on items producing a Conservative shift, 
the average individual tended to be conserva- 
tive. In studies using College Board problems 
where a risky shift was found, the average of 
the initial Probabilities chosen was quite low 
(.30), indicating Strong initial tendency for 
risk (Clausen, see Footnote 14; Wallach et al, 


that used the choice- 

are not quite consist- 
ent with the model. According to data of six 
i Y itt and Teger (s 
Verage initial 
10 items that produced a r 
risky 
ìs incongruent wi th 


isky shift is 54; for 
items, it is only .49, which 


75, Congruent w; 
Congruent results for the model can be fo 
Stoners (1968) study us 
choice-dilemma i 
Probability for six tisky item 
our Conservative items, the 
The results of the studies 
asks involving real consequences 
ent with the majority-rule deci 
model. The Same is true for 
Stoners (1968) study using 
choice-dile a items, 
more Cor 
tionnaire 
the model, "t 


average is .76, 
that used decision 
are congru- 
sion-making 
the results of 
a revised set of 
Nevertheless, the re- 

nmonly used choice-di- 
emmas ques are incongruent with 
ce with regard to 
" as an explanation 
clusive, Additional 
determine w 
‘an be Considered 
for the risky shift, 


hether the 
& valid explanation 


- validity 
If the majority-rule model has any h "m 
in explaining Shifts, positively SE ma- 
butions of initial choices (ie., with a ri atively 
jority) should yield a risky sh ift, — cau- 
skewed distributions (i.e, with a ra ervative 
tious majority) should yield a poe distri- 
Shift, regardless of the means of n a simple 
butions. This could also be viewed as z manip- 
conformity situation. Vinokur (1969) ix mos 
ulated initial distributions using the "uestion- 
risky items of the See diemma ood 
naire. Though the positively nd (—.865 
tions yielded a significant risky shi "ibutions 
b «.001), the negatively skewed distr insteà 
did not yield a conservative shift, pls Fu 
yielded a Small nonsignificant ma * nega- 
(—.33), even though the mean: of 18 on the 
lively skewed distributions was .52 (Es adf 
conservative Side of the scale). m": from 
it was Shown using reanalyzed raw ne risky 
three other studies that had yielded naire 
shifts on the choice-dilemmas pgp & 
(Wallach & Kogan, 1965; Wallach, os con- 
Burt, 1965; Wallach et al., 1962) the eats 
trary to what would be expected, the P ually 
age of positively skewed groups was vie y 
the same as the percentage of npea 
skewed groups (42.2% and 43.6%, aida 
tively; the rest were Symmetrical). Bos rther 
analvsis of the Same raw data produced | Both 
evidence against the majority-rule E “tribu- 
the Positively and negatively skewed £ p the 
tions on the three most risky 343 with 
choice-dilemmas questionnaire (4, 6, E risky 
nearly the same means (.42, .45) yielde 1.58, 
Shifts with nearly the same magnitude e (2, 
—1.41). It was only on two nonrisky i keine 
5) that the positively and negatively sk set 
distributions yielded diametrically OP ote 
shifts (—.50; $ <.05; 53, p <.01) as pre Seil 
by the majority-rule model. Zajonc 3 an 
(1968) examined data produced in their 2 Peck 
ment comparing individual and iie vais 
taking in a two-choice situation in r1 hese 
ous group decision models. None o 


2l 
1 lortvrule model: 
models, including the majority-rule mede 
could account for the group shifts 


Conservatism produced in their mcns 

In view of all the evidence, the majority es 
model is untenable. Hence, group hi Pus 
not be considered as statistical Ld em Be- 
Way individual preferences RT ps a WOW 
Sides, as was already shown; 
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bo : 
o evidence indicating that the change in 
he divid cheval preferences is genuine. 
ences follo oals change their risk-level prefer- 
asked to n s a discussion, without being 
ogan ince e any group decision (Wallach & 
Stroup dise 5) or even after listening to real 
ogan peur of the choice-dilemma items 
When is Wallach, 1967b; Lamm, 1967). 
cept the Y do reach group decisions, they ac- 
eve] mé oa decisions as their private risk- 
Maintain eas (Wallach & Kogan, 1965) and 
Cast 6 w eir new risk-level preferences for at 
Wallach » eeks (Johnston, see Footnote 13; 
models et al., 1962). In general, the statistical 
o ko on inadequate because they attempt 
that can ases eei about group processes 

àn be shown to be critical for a shift. 


Coxcrusioxs AND DISCUSSION 


po review suggests that of all the explana- 
ism i Shifts toward greater risk or conserva- 
in s ay three are viable. These are the value 
gs mation relevant to the task) hypothesis, 

Su TA ced hypothesis, and the rationality 
irst Miei expected utility) hypothesis. The 
ence 3 hypotheses are supported by direct ev I: 
ftom and are not inconsistent with evidence 
Sub; other studies. The third one, rationality 
e Jeotiv expected utility), is consistent with 

results of most studies reviewed here, but 


a 5 
S not yet been subjected to direct test. 
s, rationality 


1e two cognitive hypothese i 
Subjective expected utility) and value (in- 
o relevant to the task) could comple 
ue. h other. Both argue that the M 
Stour responsible for individual as we m 
in Mia shifts toward greater risk Le rm ) 5 
ee ae relevant to the task that is prod uce 

tonii the group. discussion. While the ra- 

orm; Ae hypothesis explains how the same E 
Val ation is processed and acted upon, the 
; ^ue hypothesis focuses on how and why new 

ye conn is produced. 

,,418 extremity hy othesis is C 
iile with the wm two hypotheses. It 
Buen in what it considers to constitute the 
or 5» factor, the key mechanism, responsible 
ity h S shifts. Whereas the value and rational- 
m proa A consider the information rele- 
E. to the issue to be the crucial factor, the 
& remity hypothesis considers the greater pref- 
rence for a certain course of action, as Te- 
flected by the extreme position, cie. 


learly incom- 
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crucial one. It argues that the greater prefer- 
ence results in greater confidence and commit- 
the extreme position and leads to 
nce. To put it differently, the two 
otheses can be contrasted with 
n respect to the kind 
he shifts. The value 
assume some kind 


ment to 
greater influe: 
cognitive hyp 
the extremity hypothesis i 
of influence that induces t 
and rationality hypotheses 
of informational influence; the extremity hy- 
pothesis assumes normative influence (see 
Deutsch & Gerard, 1955). Most of the research 
es it is informational influence 
in risk-level preferences. 
d that shifts occur after 
discussion without group decision (Clark & 
Willems, 1969; Kogan & Wallach, 1967b; 
Lamm, 1967; Wallach & Kogan, 1965), as 
well as by mere exposure (listening) to group 
discussions (Kogan & Wallach, 1967b; Lamm, 
1967), and that the acceptance of the new risk 
level endorsed by the group is covert as well 
as overt (Lamm, 1967; Wallach & Kogan, 
1965) and relatively permanent (Johnston, see 
Footnote 13; Wallach et al., 1962). Shifts on 
nonrisky (neutral) choice dilemmas were shown 
to be the result of majority influence—a risky 
shift resulted from a risky majority, and a con- 
servative shift resulted from a conservative 
majority. However, the risky shifts on the 
risky choice-dilemma items were shown to be 
unaffected by the existence of either a risky or 
a conservative majority ; consequently, thev 
can only be interpreted as the outcome of in- 
formational influence (Vinokur, 1969). 
However, the extremity hypothesis—in fact, 
all the interactive hypotheses—demonstrated 
correlations between greater confidence and ex- 
treme position, and these results were mar- 
shaled as supportive evidence. It seems that 
these results can be viewed as equally compat- 
ible with the value and rationality hypotheses. 
We only have to assume that the more in- 
formation and persuasive arguments favoring 
alternative a subject may have, the 
more extreme his position (i.e., risk-level pref- 
erence) will be. In other words, the extreme 
position taken by a subject already reflects the 
level and quality of the information he holds; 
the better that information, the more con- 
fident he will be in holding his position. It is 
analogous to the common finding in group 
problem-solving studies mentioned earlier, 
that the correct group member is also more cin 


evidence indicat 
that induces changes 
We have already note 


a certain 
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fident. Thus, the greater confidence is not 
Something inherent in holding an extreme posi- 
tion as the extremity hypothesis holds. Both 
Confidence and extreme Position are more often 
the result of the availability of better informa- 
tion and more Persuasive arguments, The Cor- 
relation between them may indeed drop to zero 


In ambiguous situations where good informa. 
tion is difficult to i 


dealing with 
Virtually no shi 


Correctness 
al problem 
of one’s choice of an 
gy. 

ctive hypothesis, the 
nnot account for shifts 
mere exposure to group 


opriateness 
ym or certain analo: 
cause it is an intera, 
extremity hypothesis ca. 
that are produced by 


Shifts 


nearly as 
in intera, 


ction groups 
j m, 1967). Tt 
would seem that neithe iti 


Constitute à 
for the appearance of the 


2 Therefore, it 


- The one we 
© related models: 
d information model and 
€ group Problem-solving model. The first 
model was originally 


ount for th 
group size and 


Solomon, 1955; 
More general stud- 
les comparing individua] and group perform- 


& 
ances (Barnlund, 1959; Hall, v for 
Blake, 1963; Thorndike, lala a o 
comprehensive reviews see Collins & rae, F0% 
1964; Kelly & Thibaut, 1969; Lo dies dem- 
Davitz, & Brenner 1958). These stu over the 
onstrated the superiority of ae onm par 
average individual on a variety n " allowe?¢- 
ticularly when full interaction wa there €x 

For any choice-dilemma problem for each 
ists a pool of persuasive arguments ive argu" 
alternative course of action, A parani io in 
ment can be equated with a cap it has 
à group problem-solving situation. -cepted y 
been brought up, it is likely to be gqec9 os 
the subjects as valid. The pool of P^lilemma 
arguments related to a given choice- a large 
item can reliably be obtained ee con- 
population of subjects; thus, predic ‘could be 
cerning individual and group shifts subjec 
made in advance, It is assumed that vi to the 
reading a choice-dilemma item atten legs 
information contained in the item, ma 


ome 
«on In 
: information 
inferences, and organizes the informa 


the 

‘ risky and ". 

arguments for and against the risky a a deci- 

H H r reac. K e 

cautious alternative, Then, he reache the 
sion concerni 


n 

ng his preferred risk level S 
basis of these arguments, If he comes p par 
many persuasive arguments favoring = iious 
ticular alternative (i.e., the risky or the ca sition 
one), he is likely to take an extreme P level 
(i.e., either Very high or very low DES en 5 
depending on which alternative the arg itt 
favor), and to be very confident and com! and 
to his decision, His extreme pouces. the 
&reater confidence are both Consequences esses- 
quality and number of arguments he poss e 
Furthermore, he may think his risk werk. 
somewhat more extreme (i.e., either more mos 
or more conservative) than that taken by e 
others (evidence for this was presented ae 
because he feels that he possesses argum 
that others may not be aware of. t. prê 

If for a given choice dilemma there is ba P 
Ponderance of Persuasive arguments for pifts 
alternative, then individual and group Fete 
in risk level are Predicted to occur in the ce 
tion of the alternative favored by the east 
ments, The Strength of the shifts is age 
determined by the Probability that an aver we 
Subject will Possess all or most of the pers 


H 1 i er 
Sive arguments, If this probability E e 
Very high or Very low, strong shifts will no 
Cur. If the 


probability is high, the individuals 


BEREA ba the 
ave already made their initial decision on th 


f 
i 
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basi 
to Beine persuasive arguments. Thus, prior 
extreme, yin group members are already 
Unlikely iad} probability is very low, it is 
hose persus individuals will come UP with 
Siderable D arguments, even after con- 
ikely mag Therefore, it is also un- 
Ossess th : group will contain individuals who 
em in dis persuasive arguments and use 
Strongest E For these reasons, the 
With a pre ifts are predicted on choice items 
for ag of persuasive arguments 
bility Ft: but with a moderate prob- 
em, Tf = an average individual will possess 
Conservati poup shift toward greater risk (or 
is Lour following persuasive arguments 
bette; uted for group movement toward 
Problem solution, then the analogy be- 


We i 
Solvin the risky shift and the grouP problem- 
Gi € model becomes apparent. 
derance of 


ge an item with a preponce 
Stren € arguments to one direction, the 
on eth of the shift is also determined by the 

p decision. Very 


seditio 
: H 
a all sh when the subject 


Shifts are predicted 
dime [erc He is likely 
argun - f the persuasive 
ion que during the process © : 
Derim hese shifts may be difficult to detect ex- 
repli entally, which may explain the failures to 
py. Cate Bateson's (1966) and Flanders and 

istlethwaite's (1967) results. Moderate shifts 


n ; hifts 
me, Predicted for a situation where subjects 
ley listen to group discussion. Here the 
s will be 


me that persuasive arguments v 
3l Ma ae up during the group discussion iig 
Who di ly increased. This will influence listeners 
their id not possess these arguments to change 
dicteg cision, The strongest shifts are py 
Enga, for asituation where thesubjectsactut y 
extrem, in group discussion, because the ess 
Beane members can ask the more extreme 
Poi bers direct questions and clarify moot 

nts, 

f this explanation is valid, there should be 


No di u 
difference in the magnitude of the shift be- 


a i 
cen groups forced to reach a unanimous 
hich individ- 


a decision and groups in W din 
BIS are asked to reach private decisions 
shifts ve discussion. Indeed, equally strong 

S were found in the two conditions by 
xd and Kogan (1965) and Lamm (1967). 
dict ermore, the strength of the shifts is pre- 

ed to increase with group size. The more 
Members in the group, the more likely it is 
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that the group will contain at least one person 
with the persuasive arguments; consequently, 
the more likely it is that the persuasive argu- 
ments will be brought up during discussion. A 
study by Teger and Pruitt (1967) demon- 
strated a considerable increase in the magni- 
tude of the risky shift with increases in group 
size from 3 members (shift of —.14) to 4 
(—.59) to 5 (—.97) for the 10 risky items of 
the choice-dilemmas questionnaire; and, like- 
wise, an increase in conservative shift in groups 
of Size 3 (—.05), 4 (A2), 5 (.38) for the two 
conservative items of the choice-dilemmas 
questionnaire. Equivalent results are reported 
by Vidmar and Burdeny,” using a different 
set of choice items, for an increase of both 
the risky shift on riskv items and conserva- 
tive shift on conservative items with groups 
of Size 2, 3, 4 and 5 members. Similarly, 
the strength of the shift is predicted to increase 
with the heterogeneity (variance) of group 
members’ initial choices. Again, a study by 
Vidmar (1970) showed significantly greater 
risky shifts in heterogenously composed groups 
than in homogenously composed groups. 

So far, nothing has been assumed about the 
nature of the persuasive arguments. It remains 
to be seen whether the persuasive arguments 
are those appealing to cultural values of riski- 
ness and caution in given situations, as Brown's 
value hypothesis implies, or whether they are 
merely informational arguments bearing ‘more 
specifically upon the desirabilities (utilities) of 
the various possible outcomes in each situation. 

Thus, the present review is in basic agree- 
ment with the conclusions reached by Clark 
(1971) concerning the necessary and sufficient 
conditions for the occurrence of the risky- 
shift phenomenon. The explanation based on 
partially shared information and group prob- 
lem-solving models introduced above provides 
further clarification of those conditions, be- 
cause it accounts for their structural relation- 
ships. According to this explanation, persuasive 
ts raised in the discussion (Clark’s 
Point a—‘“risk relevant content”) are the pri- 
mary cause. Clark’s other conditions—(b) per- 
sons who perceive themselves to be relatively 
risky (or conservative for conservative items), 


argumen 


5 vidmar, N., and Burdeny, T. C. Interaction effects 
of group size and relative risk position with item type 
in the “group shift" effect. (Research Bulletin No 
128) London, Canada: University of Western AES 
Department of Psychology, 1969. $ 
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(c) heterogeneity of initial opinions, and m 
severity of consequences of failure—are merely 
related to Point a, being the conditions under 
which Point a is operative. All these conditions 
appear jointly when the shift phenomenon 
occurs, and it is drastically reduced when even 
one is eliminated. The “risk-oriented instruc- 
tions" specified by Clark (i.e., his Point €) are 
indeed necessary for the risky shift to occur, 
but the explanation of this in terms of “de- 
mand characteristics” jg questionable, as is 
later discussed, : 

It has recently been argued that individual 


rd greater risk or con- 
ent-bound exceptions to 
y of the group, but rep. 
phenomenon: the polar- 
normative commitment 
ni, 1969). Tt was demon- 


general model that de. 

effect of Partially shared information on group 
Productivity and does not view these shifts as 
Content-bound exceptions, However, contrary 
to Moscovici and Zavalloni’s view that these 
shifts are due mainly to normative or, in our 
terms, affective Processes, the expla 
Sented here sug, 


ggests that shi 
Cognitive or information 


THEORETICAL EVALUATION AN. 
FOR RESEARCH 

The conceptual p r most of the hy- 
Potheses reviewed are drawn from the general 
mics, with occasional re- 


DI MPLICATIONS 


‘yond the rejection of 
(expected value) hypothesis, 
that both behavioral decision 
ries of individual and group 
Tocesses would benefit from 
the application of behavioral decision theory to 
complex situations (e.g., choice dilemmas), 

Moreover, the es con- 
cerning the ? of risk (Coombs & 
Huang, see Footnote 9), as well as the lack of 
convergent validity in { € assessment of risk 
taking. (Goodman see F 


many is Ootnote 155 Slovic, 
1962, 1964), raise Serious doubts about whether 
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Se ur 
studying group risk taking will ems 
understanding of group praem e fruit- 
focusing on risk taking, it might be he tenon 
ful if future studies concentrate on the 

rocess itself, . in- 
i When seeking an understanding of € 
dividual choices are processed and pee of 
into some kind of group decision, a wa 
these individual choices is essential. oe in 
it is helpful to know how they are poem this 
the first place. Zajoncthas suggeste roup task 
be a requirement for any standard p helpful 
(Zajonc, 1965). Obviously, it would akin 
to start with a good theory of decision r aie A 
Behavioral decision theory vendi d ^ any 
theory. It spells out relevant variab es ange 
decision-making situation wer der 
tainty, and suggests various relati in be- 
among them. The relevant pere into 
havioral decision theory can be classi jective 
the following three categories: (a) a Tents 
and subjective probabilities that certain salues 
will occur and their outcomes follow; (b) aE 
and utilities (i.e., subjective values) E o 
to those outcomes; (c) decision e en 
combining Probabilities and values and u hoice 
lo arrive at a decision with respect to à Si ac- 
of one from several alternative courses cision 
tion. The formulations of behavioral e ap- 
theory are rigorous, and this has made ately, 
pealing to experimentalists, Untoria s 
With very few exceptions (e.g., sace t tests 
Lavin, & Goodman, 1962) apeme ange 
of the theory have been limited to a ag has 
of experimental tasks and situations. same 
usually consisteq of outcomes with meri 
single value dimension—usually money ama 
Subjects Working for either real ne podiar 
monetary payoffs or imaginary ones (cf. ations 
& McClintock, 1967). More complex situa po- 
are those where the values of the vann oe 
tential outcomes belong to different dimen: 


i gcaora 
(e.g. money, prestige, power, health). A 
Ing to behaviora] 
make a di 


Utilities of the 


ide 

e choice-dilemmas task seems to provi 

mplex Situation for testing pane 

theory, In turn, behavioral Ris na- 

y be helpful in discovering bs cess 
ture of the group decision-making pro 


m 


ee < 
QE” M 
FP s. 
* Oc— ÉD 
—————————————————ÁÁD ' 

a 
2 : ——— ——————— NT MÀ 


Which i 
pus the ultimate purpose of most of the 
Cessfull reviewed here. How could this be suc- 
Betermia en cene First, it should be 
to Qc to what extent subjects attempt 
terms of c expected utility. In 
à subject S ni shiit paradigm, it means that 
"3 "rbi preference must reflect 
Utilities ass ion of the differences among the 
in the panes with the potential outcomes 
expected one alona situation. A subjective 
iens ee model for predicting risk-level 
can be im or a typical choice-dilemma 1tem 
Let fj “ee as follows: ——. 
Outcomes ny U; designate the utilities of the 
y af xn the certain (cautious) alternative 
( gem 1e risky alternative when successful 
f "is : wien it fails (Us); and f, q designate 
ana on probabilities that the risky al- 
ul (g) bes be successful (5) or unsuccess- 
Second, p rapa, first, U, > Ue > Us; and, 
bj Bah a= 1 Gee, q7 17 P for every 
n 
SEU = È piUs 1] 
NNNM ist 
Hebe is the subjective expected utility. 
mative. expected utility for the risky al- 
e: 
[2] 


SEU; = pU, + (1 — DU» 
f the certain 
[3] 


pected utility 
he choice of 
only when 


and 
r Subjecti a; 
al ctive expecte . 

lternative: e expected utility o 


SEU, = 1: Uc 


f Maxim: 

oviga nization of subjective ex 
th tis the decision rule, then t 
Its guy XY alternative should follow 
Biene ative expected utility is equal to or 
[UT than the subjective expected utility 


ne er i e H H 
When Certain cautious alternative: that 1s, 
EJ 


BU, 4- (1 — p)Ur 2 1: Ue [5] 


tive should be 


Or SEU; 2 SEUc, 


$ 


Ong 
4. Se 
Chog, ently, the risky alterna 
if and only if 
U, — U; 
1 mM—— [6] 
A ot U, — Us 
Ce, er "T 
ps at Words, the owes! probability of suc- 
ae 1 & subject is predicted to consider ac- 
der Oa (ie, his risk-level preference in 
ttempt the risky alternative if he uses 
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maximization of subjective expected utility as 
a decision rule is: EY 
bia Ue— U; 
U, — U; 1] 


(p : the ratio of the utilities of the outcomes, 
as judged by the subject, which also serves as n 
predictor of the subject's risk-level preference, 

3 


p. 
. Several important consequences of the sub- 
jective expected utility model are indicated by 
Equation 6. First note that the p that maxi- 
mizes subjective expected utility is not deter- 
mined by the absolute utility of any single out- 
come alone, but by the relative differences 
between all the utilities. Therefore, Clark's 
(1971) conclusion that the risky shift depends 
on the utility of failure, Uy (i.e., “severity of 
the consequences of failure") is certainly com- 
patible with the model. However, if the model 
proves to be valid, the conclusion will have to 
be extended to include the utilities of the 
other outcomes (i.e., U. and U,) as well. 
Second, in order to reach a group decision 
lowest p, it is not necessary for 
the subjects to reconcile their absolute utilities 
as is often assumed, but merely to 
ative magnitudes, that is, on 


concerning the 


directly, 
agree on their rel 
the ratio among those utilities. 
Third, for the occurrence of a shift toward 
risk, *risk-oriented instructions" (ie. the 
standard instructions requesting subjects to 
choose the lowest probability that is acceptable) 
are necessary. Without the word “Jowest” the 
meaning of the task becomes vague, since in- 
variably any higher probability is preferred to 
a lower one—a point that may be raised in a 
group discussion inhibiting the shift toward 
instructions that omit the 


risk. When “neutral” 
word “lowest” are presented to subjects, the 
model would predict that the usual significant 
risky shift will not occur. Indeed this was 
found by Clark (1971). The model would also 
redict that “neutral instructions” will not 
reduce the conservative shift on conservative 
choice dilemmas and would possibly increase 
it—a matter for future research. Thus the 
necessity of risk-oriented instructions to pro- 
duce the risky shift is shown by the subjective 
expected utility model (see Equation 6) and 
confirmed by Clark and Willems’ (1969) data. 
However, his interp 


retation that these instruc- 
tions impose “demand characteristics’ 


’ and that 
the risky shift is the outcome of "demand 
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characteristics" is questionable. From an em- 
pirical point of view, this interpretation does 
not account for conservative shifts that are 
produced with the same risk-oriented instruc- 
tions. From a theoretical point of view, these in- 
Structions are necessary to?produce a risky 
shift (but not a conservative shift) as is shown 
by Equation 6 of the subjective expected 
utility model. 

If the subjective expected ut 
Proves to be valid for individu 
havior—in other words, 
choices do reflect the relati 
ties required by subjecti 
theory—then it can be use 
nature of the group pro 
risky and conservative 


observing how the Stoup process affects the 
relationship among utilities the nature of the 
group process could be determined.) In order 
to validate the je 


» positive correlations between p and $ 


(i.e, B as in Equation 7) should be obtained 
for each choice dilemma across Subjects, as 
Well as within each subject across several 
choice dilemmas, 

efore proceeding with the anal 


Soup process, let us define some new terms, 
Let pi p 


» P2, Pr, and f, designate the risk-level 
choice before (bi) and after (p2) group discus- 
ton, and the predicted Probabilities based on 
he utilities’ ratio also before (51) and after 
2) group discussion, 
Given data regarding py, bz, Pi, and Ê», it 
st whether a cognitive or an 
is responsibl 
a cognitive 


ility model 
al choice be- 
if subjects initial 
onships among utili- 
ve expected utility 
d to clarify the exact 
cess responsible for 
shifts. (That is, by 


ysis of the 


Process underlies 
e ily be determined 
alysis. More Specifically, it will 
the effects of two viable ey- 

and conservative shifts 

d. One explanation based 
informatio 

Solving models conte 


planationg for 
could be 


5 ; Prod Y the introduction of new 
information ( » persuasive arguments) dur- 
puo lan. This explanation is 
c € informational influence hy- 
pothesis, In contrast, the other ine 
Suggests that the shifts toward risk and con- 
Servatism are not due to the influence of new 
information but to a Process of rationalization 
in which better use is made of the same inf : 
mation according to the subjective pear 


Ae aie b- 
utility model (i.e., better maximization P. is 
jective expected utility). This rd ex- 
referred to as the rationality subjec 
ected utility hypothesis, ; y- 
: According to the informational We dd 
pothesis, there has to be a change n 4 of the 
direction of the change in f nume tilities 
additional information relevant to the d not 
on which f is based. However, ae p 
be any increase in the goodness of hb pes (i.e., 
and f subsequent to the group m p nee 
the absolute difference between p Bm em 
not decrease), as increased MEM this 
group discussion is not assumed b) 
hypothesis. : . esis, 
poca to the rationality hy geni 
there should nol be a change in the vi new 
quent to the group discussion, since added. 
information on the utilities has been iiti 
The shifts in p are in the direction att isk-leve 
Ê because of Subjects changing their ris a orde: 
choices to conform to their utility ratio » on- 
to maximize subjective expected utility ? ah 
sequently, the goodness of fit between roup 
P should increase considerably after E roti 
discussion (i e. the absolute difference be 
P and Ê should decrease), ational- 
Tf both informational influence and ve | be 
ization Processes are operative, there K the 
both a change in Ê in the qoe ess 0 
change in p and an increase in the goodn 
fit between the two, ality 
There remains, of course, the pos eses Í 
neither of these two cognitive hypo tiv 
responsible for the risky and conse at 
shifts. In this case there should not be a € ood- 
in Ê after group discussion; instead, the P ase. 
Ness of fit between p and f should P an 
Evidence of this kind would suggest shifts 
affective process is responsible for the 2 or for 
that new utilities [e.g., utility for pr 
appearing risky, as the risk-as-a-valu t 
formation about others’ choices) Len 
a Bests] not represented in the ar qut in- 
the choice dilemma (i.e, Equation 4 though 
uencing the tisk-level choices. (Or, al rs ate 
unlikely, it may be that group mem Cice 
Moving away from subjective expectec ie 
and applying a different decision rue 


e 
Summary of thi. analysis is represent 
Table 1; 


Prelimin, 
and other 
for Group 


that 


ga 
hesi* 


utho" 
ente! 
cial 


H r a 
ary results of studies by xa 
researchers at the Resear e R 
Dynamics in the Institute fo 
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TABLE 1 


SUBJECTIVE EXPECTED UTILITY ANALYSIS OF THE EFFECTS OF GROUP PROCESSES ON 
INDIVIDUAL AND Group DECISIONS INVOLVING Risk 


| 
e Analysis and results 
hoice-dilemma i 
tee a bred =F : Evidence in support of: 
TREE au ||) Riesen a ond? 
Risky n 5 r P 
ao : pi bio In-Al-l pi bil Informational influence hypothesis 
nservative pi—pi>0 | pi— pil =| P2— Pel 
Risky z - iS EU 
OR i phi Ih» Hes P: Rationality (subjective expected utility) 
nservative bi pio0 In-hl» | pi— pel hypothesis 
Risky = n e , 
CE ps i0 In- hil | p2—ps|* | Informational influence and rationality 
nservative pbi pi>0 | pi—Pil > | p»— psl* (subjective expected utility) hypotheses 
d hi— <0 In—hl«l ps—ps| | Affective hypothesis (or change in de- 
Onservative ps— p>0 | pi—Pal <l: Pal cision rule) 
m E 
prote that the increase in goodness of fit can result from either changes in por f, or both, According to the rationality hypothesis, 
eque, Purports to account for the shifts in p, the goodness of fit should increase as à result of the changes in 5 rather than in f. Con- 
ntly, the results that satisfy the above mentioned equations constitute evidence for the rationality hypothesis only if the mean 


change in 4 ; 
ge in $ is greater than the mean change in P (ic^ 


R arı H n 
iim Ch offer strong support for the subjective 
des utility model of individual choice 
"u avior for the choice-dilemma problems. 
on hermore, these results indicate that 1n- 
one influence rather than rationaliza- 
hs the process responsible for group shifts. 
Vari ever, more research, employing a greater 
ety of tasks, is needed before the effects 
de Ep processes on individual and group 
i : s L : à : 
Stoog. involving risk will be fully under 
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GROUP-INDUCED SHIFT TOWARD RISK: 
A CRITICAL APPRAISAL * 
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Florida State University 


The present study € 
the prominent theories 
each. These theories are t 
pothesis, the diffusion-of-responsi 
pothesis, The present evidence sugge: 
is dependent upon (4 
selves to be relatively risky; (c) 
for which the consequences O 
instructions. The risk-as-value 
necessary conditions. However, 
and the relationship between 
future research will have to 
necessary conditions oF 
account for the appare 
others in a culturally va 
norms that conflict with 
raises troublesome and com) 


risk an 


p arrangements 
king decisions 
tendencies to- 
] daring. One 
this assump- 
is in decisions 
r dimension of 


Dus 
atten assumed that grou 
Ppress ng problems and ma 
ward 4, or dampen individual 
Set o oldness, innovation, anc 
tig n circumstances for which 
c "i longer appears to hold 
k. ear that embody a clear cime 
Seg LOWE a host of studies, initiated by 
allac in 1961 and given great impetus by 
on ch and Kogan, has demonstrated that, 
depres average, individuals advocate greater 
Ina S of risk taking following participation 
in ch than they had previously advocated 
Eu, e (Bem, Wallach, & Kogan, 1965; 
; 1968; Kogan & Wallach, 1967a; Pruitt 
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4". 


ritically examines t] 
and the empirical 
he familiariza 
bility hypothesis, 
sts that the occurrence 


the formulatioi 
nt finding that in 
lued direction, su 
the initially elicit. 
plicated questions 


he risky-shiít phenomenon by reviewing 


) risk-relevant content, 
heterogeneity of initial o 
failure are not severe, 

hypothesis seems to accou 

just how the cul 

d instructi 


determine W. 


n 


] evidence confirming and disconfirming 


the leadership hy- 
e risk-as-value hy- 
of the phenomenon 
(b) persons who perceive them- 
pinions, (d) situations 
and (e) risk-oriented 
nt for most of these 
tural value of risk is mediated 
ons is less clear. In addition, 
hether modification of these five 
oi additional conditions is needed to 
dividuals, who initially deviate from 
bsequently conform to manipulated 
ed value. The very recent research 
in this regard. 


tion hypothesis, 
and th 


& Teger, 1969; Stoner, 1968; see Footnote 3; 
Teger & Pruitt, 1967; Wallach & Kogan, 
1965; Wallach, Kogan, & Bem, 1962, 1964). 
The phenomenon occurs when subjects in ex- 
periments are asked to serve as advisers to 
hypothetical persons who are described on 
paper as being confronted by a choice di- 
lemma (Kogan & Wallach, 1964; Stoner, see 
Footnote 3); the hypothetical persons must 
choose between an alternative that is less at- 
tractive but safe and a situation that is more 
attractive but less likely to succeed. An ex- 
ample is an electrical engineer who may re- 
main at his current job at a modest but ade- 
quate salary or take a new job that offers a 
much higher potential income but no assur- 
ance of long-range security. Subjects in the 
typical experiment. make their recommenda- 
tions regarding the engineer's accepting the 
new job, on a scale of odds varying from 1 
in 10 through 10 in 10 that the new job will 
succeed. They choose their recommendation 
in terms of the minimum odds of success they 
would find acceptable before recommending 
that the engineer take the new job. After 
making their recommendations for a whole 
set of such items, subjects meet in groups and 
discuss each item in open-ended discussion. 
After the discussion they make new individual 
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recommendations for each item. The typical 
finding is that on the average, subjects shift 
toward greater risk; that is, they recommend 
accepting alternatives at lower odds of Success 
after the group discussion than they did be- 
fore. This is the prototype of the group-in- 
duced shift toward risk or, simply, the risky 
shift. 

This group-induced shift towa 


been obtained with a wide ran 
content, 


rd risk has 


ts (Stoner, see Footnote 3), 


English studen ; Israeli stu- 


dents (Rim, 1 
(Kogan & W. 


a), groups who sim- 
(Clark & Willems, 
1967), groups who 
Tcom system ( Kogan 
and groups who merely 
observe other groups interacting (Lamm, 


Two important thin 


the risky.shift phenor 


gs stand out regarding 
its d 


menon: the recency of 
Iscovery and the fact that none of the 
most common social psychological theories 
explains it adequately. Soci 


al psychologists, 
have long been interested in the differences 


between decisions arrived at through some 
group process and those that are the Product 
of individuals’ efforts (Jones & Gerard, 1967; 
Kelley & "Thibaut, 1969). However, the pre- 
vailing views are either that group decisions 
will be more conservative than individual de. 
cisions or that group decisions will represent 
an averaging, or simple convergence, of in. 
dividual decisions, Many experimental results 
can be cited to Support each view. 

An experiment by Schachter (1951), as well 
as others cited in Cartwright and Zander 
(1960, pp. 165-341), found that group mem- 
bers tend to exert the most influence attempts 


iate at 
toward those group members who Nos 
the extremes; as the opinion of tem the 
member is perceived to move e of conr 
group's central tendency, the amoun eases: 
munication directed toward him deci s exer 
In support of the view that gro dividual 
Conservative influences on the in with in- 
Barnlund (1959), comparing am ani from 
dividuals on drawing logical eon mem- 
prepared arguments, found that gn rate. 
bers were more cautious and delibera con- 
concluded: “the necessity of rapper self- 
clusion forced many students to he — wa 
critical [p. 58]; and, “group discus king, 
found to stimulate more careful aes 
lead to a consideration of a wider t€ d 
ideas, and to provoke more abe? anh 
Critical testing of conclusions [pp- on grouP 
Schein (1965), in reviewing og group 
discussion, made a similar point. identit” 
is more likely than the individual i taken 
errors of judgment before action i unctio? 
Greater care and deliberation as a fü 
of group influence is also imet 
finding by Zander anq Medow (19 lower 
subjects working in groups more often ecte 
their aspirations following an unexr pikib 
poor performance than did those 4j and 
alone. Evidence from Allport ges d hat 
Farnsworth and Behner (1931) SUE jud 
subjects tend to give more nine rou. 
ments of weights and odors when in 4 8 : 
Situation than when alone. ata tha 
While a variety of studies indice nent 
groups make more conservative ju 5 that 
than individuals, it should be emphasing 
none of the above studies offers a SyS and 
basis for distinguishing between group the 
individual risk taking. Few, if any, vith 


d 5 rned 
Studies have been directly concern 
the dimension of risk. n 


cO 
$ : ies of ©. 
Finally, the findings from studies © 


in 
E jon ^ — 
formity can account for the reduct re 
Stoups of the j 


oh dels the 
nitial variation among 

Sponses of g 

inger, 1954. K 


TOup members (Asch, 1958; 

rech, Crutchfield, & Ballac 

1962; Sherif, 1935, 1936). However. 

Brown (1965) pointed out, conformity © ye 
account 

of the risky- 


tu 

for the most important ter ic 

shift phenome i 
Shift toward risk. While Variability in. 1 
Sions is reduced after discussion, individ" 


that 


est^ 
pry? 
as 


non: the syster d 


al 
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a 5 
aa willing to accept risky decisions after 
| had b Pating in interacting groups than they 

een initially. 

Tre in the recognition that other hy- 
—. formular, than conformity would have to be 
|, Variety ci to account for the risky shift, a 
| Técent " explanations have been proposed in 
four ete to explain the phenomenon. The 
of empiri have generated the greatest amounts 
miliaris, m research may be called the fa- 
Pothesi. lon hypothesis, the leadership hy- 
Pothesic. the diffusion-of-responsibility hy- 
a S, and the risk-as-value hypothesis. 
evidence for each of these hypotheses 15 


di 
Scussed in turn. 


Tue FAMILIARIZATION HYPOTHESIS 


Tt might be argued that the risky shift is 
effect wag group effect, but a pseudogroup 
1964 as defined by Secord and Backman 
). Pseudogroup effects occur 1n groups, 
Cad actually result from group pro- 
Operati hey can be reproduced in individuals 
d independently | under appropriate 
(1965) ac Two studies, one by Bateson 
histlet and the other by Flanders and 
Shift tc (1967), indicate that the risky- 
Eea amenan might be attributed to a 
group phenomenon. 
hoo CR argued that group discussion al- 
te Persons to become more familiar with 
Situations being discussed, and contended 
d this increased familiarity with the situa- 
towa is responsible for the observed shift 
any vd, risk. According to this hypothesis, 
with procedure that will increase familiarity 
5 fte issue involving risk will cause persons 
Re oe more risky on that issue. Informal 
Roe. oe and anecdotal evidence provide 
anim ee for this hypothesis. For many 
in dais in unfamiliar situations or humans 
ee vira (eg. boys at a 
Saye swimming hole) initial periods of 
Usual exploration, and “feeling out” are 
More 7 followed by behaviors that are much 
bold and daring. To test this general 
gs ihesi, Bateson compared the riskiness of 
e ps who discussed five of the choice di- 
E with that o individuals who merely 
Doi $ briefs oF ee items indicating the 
gus in favor O and against the various 
ernatives. Results were consistent with his 
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hypothesis. Both procedures resulted in risky 
shift that differed from zero, but the two 
procedures did not differ significantly in the 
amount of shifts toward risk. Since the group 
discussion appeared to be irrelevant, Bateson 
concluded that the risky shift may be a simple 
matter of familiarization with the choice 
dilemmas. However, Pruitt and Teger? con- 
ducted two similar studies and were unable 
to replicate Bateson's results. They found no 
evidence that familiarity with the items, by 
itself, produced a risky shift. 

On the other hand, Flanders and Thistle- 
thwaite (1967) did obtain results similar to 
Bateson's. After preliminary measurement on 
the hypothetical dilemmas, each subject was 
assigned at random to one of four conditions. 
Subjects in the first condition worked indi- 
vidually, familiarizing themselves with the 
items on the assumption that they were going 
to discuss them later with other subjects; they 


were asked to concentrate on the important 
lists of “pros” and “cons.” In 
groups of three subjects 
discussed the items, according to the typical 
group discussion procedure. To determine 
whether discussion would add effects not al- 
ready induced by familiarization, subjects in 
a third condition both familiarized themselves 
with the items and participated in three-man 
discussions of them. Finally, in a control con- 
dition, subjects merely filled out the ques- 
tionnaire twice. There was a significant shift 
d risk in all conditions except the con- 
the familiarization pro- 
] as much risky shift as 
the discussion condition or the discussion-plus- 
familiarization condition. The authors con- 
cluded that group discussion adds nothing to 
the effects of familiarization with the items; 
that is, discussion is the occasion for fa- 
miliarization, but it is not a necessary condi- 
tion of the risky shift. In other words, by this 
account, the risky-shift phenomenon is a 
pseudogroup effect. 

Kogan and Wallach (1967c), however, have 
questioned the results of the familiarization 


issues, making 
the second condition, 


towar 
trol condition, and 
cedure alone produced 


4 Pruitt, D. Gs and Teger, A. L Is there a shift 
toward risk in group discussion? If so, is it a group 
phenomenon? If so, what causes it? Paper presented 
at the annual meeting of the American Psychological 
Association, Washington, D. C., September 1967 à 
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both conditions, but the Shifts under the dis. 
cussion condition wer 


experiments may be due to subtle differences 
1m procedure, In thei 


tions, Flander 


at would be revealed in à group 
condition, Subjects 
d cons of the vari- 
under the impres. 
choices would re. 
iliarization condi. 
hifts toward risk, 


but they were 
heir subsequent 
main private. Neither fam 
tion produced significant s 


ut the traditional group discussion condi. 
tions did, Teger et al. have conducted ad. 
ditional tests of the familiarization hypothesis 
with nonsupporting results. 


In the light of this evidence, it seems fair 
to conclude th 


at the familiarization hypothesis 
cannot account for the risky-shift phenome- 
non. Blank’s findings (1968) add further 


iment 
weight to this conclusion. In this exper Pa 
subjects placed bets for upcoming b from 
dice over many trials. Bets were selecte hig 
a table of odds on which outcomes bue E 
probabilities had low winnings ens "hig 
outcomes with low probabilities ha ed the 
winnings attached. Subjects either € 
betting game alone or in the simple, ubjects 
active presence of others. Since all nd simi- 
were exposed to the same materials an differ- 
lar choices, there should have been no d yet, 
ences in degree of familiarization. An pres- 
over trials, subjects who played in the - 
ence of others came to place more 


d Jayed 
Probability bets than subjects who P 
alone. 


Tur LEADERSHIP HYPOTHESIS 


+. gisk 
According to the second hypothesis ose 
takers are perceived as group as risks 
who are initially more inclined to ta ‘enti 
also tend to be more dominant and influ shi 
in the group discussion; hence, the ot ente 
can be accounted for in terms of the 1 Collins 
of risky leaders, Marquis (1962) and t this 
and Guetzkow (1964) have argued tha 


8 he phe 
is the most likely explanation of t 
nomenon, 


In one test 


al. (1962) asked group members to rank @ ne 
other for degree of influence exerted aget 
Sroup discussion. There was general S X | 
ment as to who had been most influent? he 
Support of the hypothesis, group nee 
with high initial levels of risk taking iscus” 
perceived as more influential in the di ua 
sion. However, this study is not an eee 
test of the hypothesis, since the outcome ends 
simply reflect the fact that the SOUP oul 
up at a riskier Position, prompting £ 
members to attribute greater influence, v 
those who initially favored higher risk tak! 


ly more 
;eRardless of whether they were actually 
influentia], 


n et 
of this hypothesis, Wallac ach 


a 2 
that the group reath d 
or the relationship 


i h 

.. Roger Brown (1968) has completed a thoroute 
discussion of the leadership hypothesis, with pars 
ar emphasis on general Secial-psychological issu 


p 


l 
l 


l 


; 
i 


NÉ by discussion without consensus, 
to be e ers judged the higher risk takers 
nce ET forceful in the group discussion. 
Am oe however, group members may 
a risk, Soned tat those who initially favored 
fluential position must have been more in- 
pori ore adequate test of the leadership hy- 
and Doo undertaken by Wallach, Kogan, 
akers ar (1968) who reasoned that if risk 
e bn generally persuasive, they should 
Even wh TOESMHALVE than most group members 
i have nothing 


n discussing issues that 
whose mem- 


0 h 
das with risk taking. Groups, a 
eve] pee widely with respect to their initia 
Sion or risk taking, were formed for discus- 
ternatiy du some of which contained al- 
fter TE that were balanced for riskiness. 
askeq he discussions, group members were 
P Sed to rank one another for (4) degree of 
(o in the group discussion, (b) ($4 
Provision of the best ideas, and (¢) greates 
io, sion of effective guidance to the discus- 
ons, ONE men, high risk takers eee 
fie. more persuasive by any of es 
tained 2 although the relationship was ob- 

Evi oe women. 

the fe i from other 
8a si ership hypothesis c 
Shirt t ajor explanation of th ed 
Oward greater risk taking, let alone its 
cause, Wallach, Kogan, and Burt (1967) 
groups, homogeneous as to the field de- 
E ence or field independence of their vies 
choig eet for discussion to consensus o i 
Brou dilemmas items. The two kinds A 
Tis ME did not differ with respect to var 
lage levels, both field-independent inn 
risk eee groups shifted toward gm = 
ind Tias following discussion to consensus, 
Magn’ risky shifts were of about the same 
"Enitudes, Tn addition, high initial risk 
e "n were perceived as more forceful by 
ee subjects but not by eo 
prodest subjects. The important point Tor 
Shift a significant risky 
but there 
k taking 


studies reveals that 
an hardly qualify 
the group-induced 


Sole 


a 


nt purposes is that 
Was was found for the latter group, 
eval © relationship between initial ris! 
Not — perceived influence, ‘This result could 
fie] ve accounted for by the possibility that 
eld-dependent subjects are unable to detect 


differences in the exertion of influence, for 
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members of field-dependent groups showed 
significant within-group agreement in their 
rankings of group members for relative force- 
fulness in the discussion. 

In a related study, Kogan and Wallach 
(1967d) varied groups in terms of mem- 
bers’ test anxiety and defensiveness levels. 
Despite the fact that the groups shifted to- 
ward risk, there was no significant relation- 
ship between level of initial risk taking and 
degree of perceived influence in the discus- 
sion. For groups composed of high-defensive, 
low-anxiety members the reverse was even 
true: low risk takers were slightly more in- 
fluential than high risk takers. Once again, 
the result could not be attributed to differ- 
ential ability of group members to detect de- 
grees of influence. 

Brown (1965) reported a study by Nord- 
who constructed situations on which 
in the opposite direction, that 
on. When the group members 
ndicate the more persuasive 
named those 


hoy, 
groups shifted 
is, toward cauti 
were asked to i 
members on these items, they 
who had been more conservative, not more 
risky, in their initial positions. Rabow, Fowler, 
Bradford, Hofeller, and Shibuya (1966) found 
similar results. These findings suggest not that 
risk takers are more persuasive but that 
group members attribute greater persuasive- 
ness to those individuals whose initial views 
were closer to the final position of the group. 

A study by Edwards and Willems (personal 
communication, July 1970) questions the 
leadership hypothesis. If the risk-prone per- 
son is naturally influential in group discus- 
sion, his influence should be exerted whether 
he argues for a risky position or a conserva- 
tive position." Edwards and Willems asked 
persons who initially had risky positions to 
put aside their real views and argue instead 
for the conservative positions, while persons 
who were initially conservative were requested 
to argue for the risky positions. Group mem- 


è Rim (1963, 19642, 1964b, 1966a, 1960b) suggested 
that high risk takers are characterized by a tern 
oí personality traits that predispose them to leader- 
shin and influence. Some of the traits of high risk 
ws are high in extraversion, Machiavellianism, 
1 for achievement, tolerance of ambiguity, radi- 
d interpersonal values of leadership and 


tak 
neel 
. an 
recognition. 


L D. 


likely to shift toward 
by initially conserva- 


ever, a significant shift 
dition, 


* can on] 
hypothesis is major contender among 
explanations i 


risky-shift Phenomenon, 
The finding that high risk take 


Y conclude that the 


leadership 
not a 


S shift toward Breater risk even 
When 'gh risk takers do not exert greater 
bersuasiveness than low risk takers, (5) that 
It is easier i 


other Processes than leaders} 
for the risky-shift phenomenon, 


Tur Dirruston-or-Responsintrrty 


Hyporursis 


In its 


tS present form, the diffusion-of-re- 
sponsibility hypothesis emphasizes that dis- 
cussion (a) 


Produces e 
tween members anq (b) 


from full responsibility 


motional bonds þe- 
frees the individual 
for his later decision 
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sson has 
because he perceives that his decision 
been partially shaped by the group. nsibility 
The concept of diffusion of Mein d 
is appealing because of its ape t 
contexts. It has been used to exp The in- 
risky actions of mobs and crowds. niversal- 
dividual’s temporary impression ib or mo 
ity among the members of a crow onymity 
and his Perception of a guaranteed an ;erform 
is used to explain why individual Lo such 
inhuman acts in a group setting wh ide ! | 
acts would only bring disgust outs 
setting. rator? 
Pura Wallach, and their collabo 
have carried sedon dig 
they support the aye 
responsibility hypothesis. In one obabilit 
(Wallach et al., 1964), stakes, the apes 
of achieving these Stakes, and group r under 
bility were varied to reach a met risky 
standing of the factors involved in = threes 
shift. Subjects, Who meet in groups ld be 
were told that their main task woul 


5 ta 
answer 10 multiple-choice questions q that 


incorrect ones, docere ia 
Subject haq to determine beforehand bue 
ficulty leve] for each of the 10 quet 5 
Would answer, The more difficult the ve for 
tion, the more money he would receiv 

à correct answer. 


f 

vel 0 

After the subjects had chosen the a ex- 
difficulty for Problems 1-5, the follow for 
Perimental int 


: «een diced 
€rventions were introdu 5 
ES ‘ In a cont 
the remaining five questions, Ir £o wor 
condition, Subjects merely continued 
individually, 


on: 
Subjects wer 
the first five 


iti 

Tn the Broup-decision condi 
e to discuss difficulty leve che 
Problems until they had mis in- 
SToup consensus, but eac each 
l eventually have to solve for 
Problem by himself, In the Paions Ty the 
sroup condition, Subjects were told tha te 
would actually be atten? ine 
determined member o tw? 
» but the Winnings of the other out 
be entirely dependent upon the | to 

his Performance: each subject yer 

llty level of the five ae y 
ore the solver for each was random e 
designated, and the solver would receive t 


M AS 
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pro 
Pena the level he had selected. Ina 
Subjects Se eres condition, 
mous deci ere to discuss and reach a unani- 
e TN concerning difficulty levels of 
is condita to be randomly assigned later. 
Only in s ion differed from the previous one 
Finally Rae there was a group discussion. 
Stoup im a group-designation condition, the 
or each only determined the difficulty level 
à repres problem but also was to designate 
Ach , o ehtative who would attempt to solve 

T Problem, 

di ig lis showed that the control group 
Biss, ee from each of the other 
Shift o ntal conditions. A significant risky 
t occurred in the decision conditions where 
Cision e Pad to reach a unanimous group de- 
ber, Th at would be binding on each mem- 
à Proble, designation of an individual to solve 
CUrred em had a different effect when it oc- 
about i connection with a group consensus 
Such the problem than when there was no 
jects Consensus. In the latter condition, sub- 
ES actually became more conservative. Ap- 


arı R A : 
ently, in the absence of the discussion to 
ay of know- 


Con, 

s Mines 

in, DSUs, the individual had no w À 
e other 1n- 


y 
divi a what level of difficulty th 
duals would prefer to work; the burden 

e allure rested squarely on the shoulders of 
nse dividual decision maker. Hence, NUS. 
Others wus occurred when responsibility i 
addit was created without opportunity Tor 
e lonal communication. However, when 
ditt? decided on a particular level of 
"m d and then a member was chosen, 
Probie” or by the group, to solve a. gre 
mie nig was a maximal diffusion o 
hej nsibility. In this case the individual was 
" id personally responsible. for choosing 
ing ifficulty level nor responsible for decid- 
ieee to work on a problem. If he was 
Would to work on a problem and failed, there 
ad be the consoling fact that the decision 

* hot been his alone. 

Su "he second study, Bem et al. (1965 
ogie T to participate in a study of physio 
mente effects on problem solving. Six experi- 
ino hes described. to three subjects at a 
othe: the subjects being separated from each 
vol r by partitions. The six experiments, in- 
Warne oHactaty stimulation, chromatic stim- 
ation, movement, taste, audition, and sensi- 


) asked 
ysio- 


A. 
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tivity to odorless gases, were concerned with 
the effects of various physiological stimula- 
tions on the ability to solve simple verbal and 
mathematical problems. It was explained that 
subjects would be randomly selected to ap- 
hese experiments during their 
If a subject was selected to 
e of these six experiments, 
he could specify the extent to which he was 
willing to undergo painful temporary side ef- 
fects so as to obtain more money. The more 
probable a side effect, the more money he 
would receive, except that he would be ex- 
cluded from the study if he actually suffered 
the side effects. The payments ranged from 
$2.80 if the subject chose the alternative that 
had a 1096 probability of producing side ef- 
fects to $25 if he chose the alternative with 
a 9096 probability. Subjects choosing the 
abilities obviously had made risky 


pear in one of t 
second session. 
participate in on 


latter prob 
decisions. 

After the subjects filled out the risk-taking 
questionnaire, the partitions between subjects 
were removed, and they were requested to 
discuss each of the six experiments and reach 
a unanimous decision about the type of stim- 
ulation they would endure if the group were 
chosen to participate in a subsequent experi- 
ment as a group. Group decisions were sig- 
nificantly more risky than the mean of the 
decisions made by the group members as in- 
dividuals. Furthermore, the private decisions 
of these same subjects, obtained after com- 
pletion of a group discussion, had also shifted 
significantly in a risky direction. Responses 
in other conditions in the experiment showed 
that the mere anticipated public disclosure of 
personal decisions did not lead to a risky shift 
and that subjects became more conservative 
when they anticipated undergoing the con- 
sequences of the decisions in the company of 
others or when they anticipated discussion to 
consensus. The authors concluded that the 
latter findings clearly support the diffusion-of- 
responsibili y hypothesis at the expense of 
alternative explanations. 

Wallach et al. (1967) systematically com- 
groups of male adults in terms of field 
dependence-independence of their members 
Although both field-dependent and field-i n 
dependent groups shifted toward greater risk. 
taking after discussion, longer discussion 


L posed 
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esis ÍS 
The diffusion-of-responsibility eg the 
risky shift, while longer di i Strongly supported in these be which 
Li 1 introduction of personality var 1s own con 
presumably reflect the individual's In every 
cern about personal pne id to shif 
Case, subjects who — diffuse 
toward risk when responsibility w cts at the 
did show Sreater shifts than su con- 
ng group members other pole of the relevant oer respons 
sponsibility, On tinuum. Nevertheless, [pe fe variance 
the other hand, long discussions by field-in. bility cannot account for all o therwise ¢ 
uld emphasize the cog- in the risky-shift Studies, for o t subject? 
» Which, according to  fensiye Subjects, field-independen 
ot be conducive to the 


ot have 
and subjects with low anxiety would <i di 
tional ties necessary for shown the Significant risky Us sonaia 
iffusion of responsibility to take place, In occur, In short, the work oF a directly 
addition, the authors found 4 Significant DOsi- variables that is supposed to ste 
tive Correlation between the amou 


5 
"nns hypothesi 
*Y — from the diffusion-of-responsibility hyp oth- 
a field-depende: 7 


3 at hypt kv 
does not unequivocally support ye the risk? 
esis as the exclusive determinant o 
he was 4 member, The 


ints 
aa : hat p 
shift. There is additional evidence t some 


‘ 1965) 
for one’s ac Improving on Wallach and Kogue't CA in- 
tions, No Such relationship Was found for the design, Teger and Pruitt (1967) con : condi- 
fie d-dependent Subjects, who presumably formation exchange with discussio gave 
are not prone to minimize à sense of Personal tions. In the 

responsibility for their actions 


former condition, apiece 
formation about their P Fiscus” 
Various issues without s were 
in the latter, the i foun 
discussed at some length. The authom ition 
significant tisky shifts for both esit in 
anxious. ]oy, defen- though the Shift was considerably g 


each other in 
risk levels on 
ing the Issues; 


: zoga 
defensive high the discussion-to-consensus condition. es d 
8^ anxious-hign de and Wallach (1967c) contended that P 
ensive Ignificant risky shifts were found in i 

for all » Dut the magn 


roup 
à : $ in the 

shift Varied, with anyi g for the greater risky shifts found in the £ 
most and Sive subjects, the least, The i 

authors? ; ion j 


s 0 
discussion condition, However, the eee: 
hat homogeneous Teger and Pruitt’s study show that Lge. 
groups o anxious individuals are uniformly is not necessary for a significant shi 1 Pruitt’ 
fearful of failure: Discussion of the choice risk to occur. The results of Teger and 
dilemmas Makes these individuals more wil]. 

ing to qi 


information. 
use responsibility in an effort to 
relieve the b 


: 9a) an 
replicated by Clark and Willems (1969 
urden of possible Personal failure, by Willems 
In contrast, ho 

individuals 


ality 
S Ned : nsibilits 
Furthermore, the diff ple ger 
"ypothesis fails to Predict the high 
others for fear of exposing tj 
personal Weakness 


4 
A i see Footnote 
tion that Pruitt and Teger (see ong items 
es? This orientation inter. und between the initial um receive 
feres with affective interchange among group and risky shift. Items that initially 
members ang hence ten 


‘tt. 
: st shif 
5 to inhibit diffusion — the riskiest Judgments produce the mo 
of responsibility, 


ERE esis 
he diffusion-of-responsibility hypoth 
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Wo 
xs have to predict the opposite, since the 
ihe uS Hebi are mostly those in 
Negative q tisky alternative has important 
19695) . Pago (Clark & Willems, 
Very Es initial responsibility should be 
Should bo ed felt on these items, and there 
Donsibility P Cate effects of diffusion of re- 
ma, le ae Kogan and Wallach have not 
develope icit the relationship between the 
pes of emotional bonds, formed dur- 
e Penn and the negative consequences 
CUssion “alles decision. They stated that dis- 
Proport; ows each member to feel less than 
failure e n to blame in the event „of 
Ed Ccause he sees others as contributing 
Subject ae with him. How this makes the 
Sequence ore willing to risk the negative con- 
es Hs of the decision remains unclear. 
= discussion allow subjects to avoid 
Somehow ee negative consequences? Does n 
3 negati essen the subjective probability © 
Ve outcome? Or, do the negative con- 


Que; Y 
ig actually come to be viewed more 
th Xelye Kogan feels that a discussion of 

rity 


e 
Situations lessens the perceived seve 
* negative consequences (N. Kogan, per- 
pitice Munication, April 1969). However, 
SUch ; al research is needed to establish that 
M the case, 
ee mS and Clark (1971) and Pruitt and 
dence (1969) have found disconfirming evi- 
lishn for the relationship between the estab- 
d s ead bonds among em nad 
e shift. Willems and Clark varied re 
CY of the content of discussion orthog- 
oe across conditions of information ex- 
syp; S€ and discussion. The obtained shifts for 
i ares wa either exchanged risk-relevant 
bien M or who discussed risk-relevant 
Bh, dite significantly from zero; such 
Who xs shifted more toward risk than those 
o a AR exchanged irrelevant information 
Hu sed the irrelevant. propositions. Since 
Ssion of irrelevant propositions should 


Dro 

d à 

uce emotional bonds among group mem- 
a signifi- 


Sona] 


Onal 


Bane RO since this did not result in à 

iu shift, Willems and Clark feel that this 

iore ar the diffusion-of-responsibility hy- 
esis was refuted. 

oe could argue that these experimental 
angements did not really test the second 


part of the diffusion-of-responsibility hypoth- 
esis: the implication of others in the final 
decision. However, a betting experiment con- 
ducted by Pruitt and Teger (1969) found no 
risky shift in a group decision where the mem- 
bers of the group had engaged in past discus- 
sions about other issues but were not per- 
mitted to discuss or exchange information 
about the current issue. In this experiment, 
satisfactory conditions were set for the crea- 
tion of emotional bonds among group mem- 
bers, and responsibility of the group for the 
decision was achieved by informing each 
that group decisions would be the 
average of the individuals’ decisions. Each in- 
dividual would have to face the positive or 
negative outcomes of this averaged decision. 
No shift was found even when there were emo- 
tional bonds among group members and when 
others were implicated in the group decision. 
Also, Dion, Miller, and Magnan’ found in 
disagreement with the diffusion-of-responsibil- 
ity hypothesis, that low cohesive groups ex- 
hibited greater risky shift than high cohesive 
groups. At the least, this finding indicates 
that the creation of emotional bonds is not 
necessary for the risky shift to occur. The 
authors, however, do see the finding as con- 
sistent with a diffusion-of-responsibility ex- 
planation. They suggested that as group mem- 
bers become more attracted to each other, 
they also become less willing to diffuse per- 
sonal responsibility onto other group members 
for the negative consequences of a risky ac- 


member 


tion. 
Although the diffusion-of-responsibility hy- 


pothesis seems to have been refuted in ex- 
riments involving the choice dilemmas or 
betting, it should be pointed out that it 
may have explanatory value in other con- 
texts, for instance, the behavior of bystanders 
in emergencies. Darley and Latané (1968) 
subjects to overhear another subject 


pe 


caused hear 
appear to suffer an epileptic seizure. More 
subjects from two-person groups responded 


om three-person groups, and 


with help than fr 
than from six-person 


more from three-person 


:Dion, K. L. Miller, N., and Magnan, M. A. 
Group cohesiveness and social responsibility as de- 
terminants of the riskv shift. Unpublished manu- 


script, University of Minnesota, Minneapolis, Minnc- 


sota, 1969. 
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groups. Darley and Latané argued persua- 
sively that the diffusion-of-responsibility hy- 
propriately accounts for by- 
than the often acclaimed ex. 


, Te 
g all 


because they 
"individual's responsibility 


diffused by the presence 
221]." A 


do not see how the « 


ormer, other 


People serious The different 


results in the two types of situations are 
Intriguing and should be Systematically 
studied.’ 

Notwithstanding the a 


apparent intuitive ap- 
usion-of-responsibility hypoth- 
usefulness in accounting for re- 
bystander intervention, 
and Lata 
tween emotional b 
bility; furthermore, 
is conceptually unn 


peal of the diff 

esis and its 

search on it shows 
8 Darley 


né do not assume 
onds and 
the creati 
ecessary 


a linkage be- 
diffusion of responsi- 
on of emotional bonds 
to account for the be- 
havior of individuals in emergencies. 
"An excellent Summary of the authors’ work on 
bystander intervention in emergencies is provided by 
Latané and Darley (1969), 
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: e 
serious shortcomings when applied be S 
risky shift because: (a) Contrary to for the 
pothesis, discussion is not necessary js can- 
risky shift to occur; (b) the hypothe e by- 
not account for cautious shifts; Oe that 
pothesis cannot account for the fin hich the 
initially risky items are those for Xhel is 
greatest shift occurs; (d) the wg mos! 
inconsistent with the finding that ve per- 
shift occurs with those items that a ences} 
ceived to have the least serious ca the 
(e) the hypothesis does not specify subjects 
Creation of emotional bonds among  eoative 
makes them less concerned about the anally, 
consequences of risky decisions; (/) pe the 
most damaging of all, it appears R the de 
exchange of relevant information, re neces” 
velopment of emotional bonds, that short f 
sary for the risky shift to occur. In $ factor 
Seems rather clear that some other ember? 
in the communication among group ™ 


ift toward 
is responsible for producing the shi 
risk, 


m ;SIS 
Tur Risk-As-VALUE HyPOTHES 


. retar 
The most persistent and popular qu er 
tion of the risky shift (Brown, 1965; -—- 5. 
& Schneider, 1969; Teger & Pruitt, make? 
Wallach & W ing, 1968; Willems, 1989) is * 
the key assumption that moderate ES 
Stronger cultura] value than caution an 
individuals come to view themselves He risk: 
at least as willing as their peers to en and 
When a group of such persons mee whos 
discusses matters of ri those tha? 
initial private position ive 
the group average will 
Cautiousness and will 
mend greater risk than 
restore their 
tively risky. 
tion of the g 


hat 
eing 


sk taking, : 
S were less risky à 
recognize their ps 
subsequently re o 
before in ad 
berceptions of themselves 25 une 
On this view, the principal rouP 
Toup discussion is to allow £O, e 
members to Compare their positions 0n 
55sue to those of other people. -sons 
ere is considerable evidence that Re jing 
Perceive themselves to be at least as we 
SS their peers to take risks. Hinds" 85% 
Male graduate Students in industrial marag B 
in o Binds, W., c. Jr, Individual and group decisio" 
™ gambling situations, Unpublished master’s the)’ 
“an School 


sti 
of Management, Massachusetts In 
tute op Technology, Cambridge, 1965. 
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Ment to try to guess for two choice dilem- 
im What alternatives would be chosen by 
subje people like themselves. He found that 
cie very consistently guessed that oth- 
the E choose more cautiously than had 
One * jects themselves. Brown (1965) asked 
One indergraduate class of 30 students and 
memara seminar of 16 to answer one 
then te ilemmas problem for themselves and 
Selves © guess how other people like them- 
at ^ ca respond. Not one subject. guessed 
imself ers would answer more riskily than 
icating Brown interpreted this finding as m- 
Self tob that each individual conceives him- 
is eae at least as risky as the average of 
a Senerality of this finding is affirmed in 
(ogg towing studies. Wallach and Wing 
found 3 using six of the choice gsm. 
them that both males and females perceive 
T Wes to be more risky than their peers. 
ringer and Schneider (1969) found the 
1 © results for 9 of the 12 items. Willems 
made the same inquiry in the case 
819, Single risk-taking item and found that 
9 Of his subjects perceived themselves to 
tisks least as willing as their peers to e 
"eplic Villems found the same results in 
Cation of this experiment. Thus, the data 
te “quivocally reveal a strong and pervasive 
the, Mey by persons of both sexes to Sew 
De "USelves as being at least as risky 25 their 
ers, 5 


rom the risk-as-value hypothesis, how- 


e A s 
Er Jt would follow that those subjects who 
t Ceive themselves as more cautious than 

| risk fol- 


Du Peers should not change towarc i 
d group discussions. Clark, C rocket, ai 

Dari er (1971) tested this proposition by co 
hg discussion groups whose members per- 


E tele 

thee’ themselves as being at least as risky as 

Cir peers with groups of subjects who per- 
cautious than 


iud themselves to be more i 
th. ^, Peers. The results showed that only 
5 Ormer group shifted significantly from 
Brou; the magnitude of shift in the DE 
Bron did not differ from that in a contro 
Cong. Thus, Brown’s second assumption was 
Burr. ed, for instance, persons who actually 
Ceiy, €d toward risk were the ones who per- 
ie themselves to be at least as willing as 
ir peers to take risks. 


er 
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According to the risk-as-value hypothesis, 
the main function of the group discussion is 
informative; it makes clear the level of risk 
that others will assume in a given situation. 
Bem et al/s (1965) experiment on aversive 
consequences was designed to test this hy- 
pothesis. Instead of responding to the choice 
dilemmas, subjects in a psychophysiological 
experiment were asked to select the experi- 
mental treatment they would endure from a 
set of treatments that varied in their likeli- 
hood to involve disagreeable physical side- 
effects, The investigators reasoned that in 
this situation, risk taking would connote the 
valued properties of courage and boldness 
and that a risky shift should result when the 
subjects anticipated that their decisions would 
be revealed to others. In one condition, sub- 
jects were told that each person's decisions 
would subsequently be made public in the 
others’ presence; in a second condition, sub- 
jects were told their decisions would be avail- 
able to each other so that they could discuss 
them and make a unanimous selection on just 
what stimulation they wanted to be employed. 
The investigators felt that the social desira- 
bility of risk taking should be particularly 
enhanced under these circumstances because 
the subjects thought they would have to de- 
fend their positions in a subsequent discus- 
sion. However, no evidence of shift toward 
risk was found in the first condition, and a 
conservative rather than a risky shift in the 
second one. The authors concluded that the 
shift to risk in group decisions cannot be 
attributed to a social value on greater risk 
However, the relevance of this study 
value hypothesis must be called 


s Brown (1965) pointed out, 
both conditions who thought 
their decisions would be made public were 
not actually informed of one another's de- 
cisions, and since each subject may have al- 
thought himself bold and venturesome, 
d for him to become more 


taking. 
to the risk-as- 
into question. A 
the subjects in ! 


ready 
there was no nee 
risky. 

A more adequate test of the role of in- 
formation was provided by Teger and Pruitt 
(1967), who compared groups that discussed 
the choice dilemmas with groups that merely 
exchanged information with one another about 
their initial choices. By the risk-as-value hy- 
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pothesis, the exchange of information should 
be enough to show many subjects that they 
are not as risky as most others in the group; 
those subjects should then shift toward greater 
risk. This, in fact, was what Teger and Pruitt 
found. Similarly, Kogan and Wallach (1967c) 
found that listening to a tape recording of 
an actual group discussion led to increased 
risk taking. In this study, as in the former 
study, information was made available that 
permitted the subject, in the absence of dis- 
cussion, to compare his own risk-taking posi- 
tion with the positions advanced by others. 
Nevertheless, in both of the studies just cited, 
simple exchange of information was not as 
effective in promoting the risky shift as was 
the group discussion; this suggests that the 
risk-as-value hypothesis by itself does not 
account for the phenomenon. However, if one 
assumes that the discussion condition provides 
experiences that enhance the value of risk 
taking, then the centrality of this hy 
can be defended. The discussion condi 
only provides information concerning each 
individual's position on the issues but it also 
presents each individual's arguments in de- 
Tense of his position, making the objectively 
less risky person more aware of the value he 
and others place on risk (Teger & Pruitt, 
1967). 

In support of this ir 
(1967) found that sub 
group discussion throu 


shifted toward greater risk in an amount that 
did not differ significantly from the shift 
observed among subjects who actually partici- 
pated in a group discussion, Í 
Kogan and Wallach’s (1967c) listening 
condition and Lamm’s (1967) observation 
condition might also be interpreted as sup- 
porting the diffusion-of-responsibility hypoth- 
esis since the groups did, after all, engage in 
discussion, Tt is possible to untangle this 
theoretical issue by initiating experiments in 
which one of the hypotheses makes a pre- 
diction that the other one does not. Vidmar 
as done such a study. Three rela- 
y Mogeneous sets of groups, consisting 
of high, Medium, and low risk takers, were 
compared with mixeq groups composed of two 
high, one medium, and two low risk takers, 
All four types of 8toups showed the risky 
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pothesis 
tion not 


iterpretation, Lamm 
Jects who observed a 
gh a one-way mirror 
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shift, but the mixed, or heterogeneous P 
showed significantly greater shifts than o- 
more homogeneous groups; the thre - 
geneous groups did not differ fuu ift. 
from one another in magnitude of risky S 
Vidmar also found that the magnitude is a 
dividual risky shift was inversely relatec a 
the member’s initial risk position in the LK 
That is, the objectively conservative mem! S 
in a group shifted more than did the ier 
members, exactly as the risk-as-value hye A 
esis predicts. It should be pointed out Ai 
the familiarization hypothesis and the ana 
sion-of-responsibility hypothesis cannot a 
count for the twin facts that the hen 
neous groups shifted more toward risk eee 
the homogeneous groups and that the he 
tively conservative subjects shifted more 
the objectively risky members. oge- 
Willems and Clark (1971) varied homoE" 
neous and heterogenous groups of igmales d 
thogonally across conditions of inform? jce- 
exchange and discussion, using the six die 
dilemmas items that had produced the ierit 
risky shift in previous studies. The Bes 
for group compositions were as follows: jan 
three critical items (4, 6, and 7 by we 
: son 
and Wallach's numbering) each perso » 
the homogeneous groups did not eva 
more than two scale points. For DNO a 
stoups, at least two persons differed yee 
least four scale points on each of the an 
items. Under both information-exchange eous 
discussion conditions, only the heterogen af 
groups gave shifts significantly greater g xe 
zero. These findings were interpreted P uP 
flecting the fact that in heterogeneous 8! sub” 
the probability was increased that some yere 
jects would discover that their choices i ^ 
not as risky as their peers, producing s in 
tion to shift, In the homogeneous grouP two 
which subjects differed by no more than per 
scale points, there was no real need ior, was 
Sons to shift toward risk because nobody wert 
less risky than his peers; all subjects 
already actualizing the value of risk. n and 
The results of the study by Willems os? 


Clark differ in one important way from ig 
reported by Vidmar, While both son jil- 
nificant shifts in heterogeneous groups: gift 
lems and Clark did not find a significant ê js 


. H ps: 
toward risk in the homogeneous groul 
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Ear did. The difference may be due to a 
iB = difference in procedure. Vidmar formed 
E of high, medium, and low risk 
10 E y summing each subject's scores over 
items, hor dilemmas. Summing scores over 
Spons owever, can conceal diversity of re- 
and a on certain key items, like Items 4 
Teire € the Wallach and Kogan question- 
un ince these items seem to be responsible 
UM of the shift found in a variety of 
Sons 5, Willems and Clark argued that per- 
similar] homogeneous groups must respond 
ceed on those key items. Otherwise, ap- 
eh y homogeneous groups may, in fact, 

T eterogeneous. 

pum is empirical evidence that supports 
Mu enHon, Using the same criteria a5 
Iv (1970), Hoyt and Stoner (1968) 
homoge significant shifts toward risk in their 
shift Seneous groups, while the magnitude of 
cantly 1 these groups did not differ signifi- 
our from that in previous studies in which 
ot were randomly composed and, there- 
same aime to be heterogeneous. At the 
cant ime, Hoyt and Stoner found a signifi- 
i wee relation between dispersion of 
hag dual preferences on an item and the 

nitude of group shift. They remarked: 


"Thus ; 
th 5 it appears that homogeneous grouping reduced 


E n sane 
each anges of total individual risk scores within 
(by almost three- 


four P very substantially ; 

Very“) but did not reduce individual item ranges 

Stong, PPreciably (by about one-seventh) [Hoyt & 
^ 1968, p, 281]. 

Dus 
Othe tt and Teger (see Footnote 4) cited 
ion ce supporting the risk-as-value 
Wine thesis. They compared initial risk level 


in, “Mount of risky shift across the items 


a : 
tor, Iber of studies and found a Very high 
an «ation between the initial position on 


Ti tem and the risky shift for that item. 
tisk, © Judgments on items that elicit a 
tisk, léSponse initially become even more 
"auto after discussion; items that elicit a 
Cay US response initially become even more 


tii ) 
arp OUS after discussion. Pruitt and Teger 


Valye >, that this finding favors the risk-as- 
Aer, "Ypothesis, on the proposition that the 
dicates 


Mhetpe initial position on an item in 
vn E the value of risk or the value of cau- 
Win S appropriate to that issue. Clark and 

™s (19695) also found a very high cor- 
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relation between the mean initial responses 
to the 12 items of the Kogan and Wallach 
questionnaire and the national norms of risky 
shift for these items. 

Further compelling evidence for the risk- 
as-value hypothesis comes from a betting ex- 
periment by Blank (1968). Subjects chose 
bets in either a group or an individual situ- 
ation. Under individual conditions, subjects 
chose a significantly greater number of high- 
probability bets than did subjects in the 
group condition. More importantly, subjects 
who transferred from the group condition to 
the individual condition did not differ sig- 
nificantly in the riskiness of their bets from 
subjects who continued in the individual con- 
dition; however, subjects who transferred 
from the individual condition to the group 
condition showed a significant shift to greater 
risk. This study clearly indicated that the 
risky shift occurs in games of chance, that it 
may occur even when there has been no group 
discussion, and that it does not endure when 
subjects are switched to an individual condi- 
tion.!* Since no discussion was allowed in the 
groups, Blank asserte 


d that the diffusion-of- 
responsibility hypothesis could not easily ex- 
plain this finding. 


The most attractive feature of the risk-as- 


value hypothesis is that it can be easily ex- 
tended to account for items on which judg- 
ments consistently shift toward either risk 
or caution and for items on which no con- 
sistent shift in judgments is evident. If an 
issue invokes values favoring the risky alter- 
native, then the hypothesis would predict that 
most individuals would consider themselves 
to be at least a5 willing as their peers to take 
risks, so that the effect of group discussion 
would be to produce à shift toward risk. For 
those items that elicit cautious values, the 
hypothesis predicts that individuals will per- 

o be at least as cautious as 


i t 
ceive themselves 
their peers and we would expect a shift to- 
ward caution as à result of the group discus- 


sion. Stoner (1968) has performed a study 
that confirms these predictions. He presented 
his subjects with a 12-item questionnaire in 
which 6 items were assumed to elicit risky 

d Vallach (1967a) reported several 
Ex choice be were the shift 
did endure after discussion. 
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values and 6 to elicit cautious values. Sub- 
jects also completed a value-rating instrument 
that required them to rank in order of im- 
portance 18 phrases that reflected values that 
were implicit in the outcomes of the 12 
dilemmas items. On the basis of the subjects? 
rankings, the 12 dilemmas items were di- 
vided into two groups: those for which the 
risky alternatives implied values that were 
ranked higher in importance than the cautious 
alternatives and those for which the 
was true.** Subsequently, 
items in groups and arriv 
cision. Results generally 
as-value predictions. Sub 
initially on risk-oriented items and more 
cautious initially on cautious-oriented items. 
Significant differences between individuals 
perceptions of their own and others’ riskiness 
were also found. On risk-oriented items, sub- 


jects judged themselves to be relatively risky 


and, after unanimous group decisions, gave 


Tesponses that were more risky than the aver- 
age of the initial individual decisions. On 
caution-oriented items, however, the results 
were less clear-cut, Subjects perceived them- 
selves to be more cautious than most people 


X cautious-oriented i 
tems, twi 


opposite 
subjects discussed 
ed at a group de- 
Supported the risk- 
jects were more risky 


tems; 
© showed nonsig- 
r the direction of caution, 
while the third showed nonsignificant judg- 
ments in the direction of risk. In addition, 
cautious-oriented items 
shifts, the other four 


of the importance of 
sky and cautious items 


: : Olve one of the major 
problems with the risk-as-value hypothesis, 


CE Fd risk-taking items do involve 


the importance Of risky 
can be determined, then pe 
of distinguishing between 


values and if 
and cautious alternatives 
rhaps we may have a way 
T = those items that produce 
a shifts and those items that produce risky 
tae Pet tisk-as-value hypothesis would not 
eben GE tie, p planation since subjects’ per- 
3 N relative riski ji 
Jie consiston: with an € riskiness would tend to 


da good predict, 
Sequent group shift toward either caia, 6 st 
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issues 
that of distinguishing in advance pec pt 
on which a shift to risk will occur E 7. addi- 
associated with a shift to caution. on 
tional distinction can be made betwe Charl 
oriented and caution-oriented race wx 
and Willems (1969b) have suggeste ach sie 
perceived consequences of failure in pu ihe 
ation play a crucial role in determina 
level of risk subjects will take in -— P — 
ation. They asked subjects OE i chose 
the hypothetical person in each situa it failed. 
the attractive alternative and that 1 serious 
Subjects were asked to indicate ed ratings 
the consequences would be. The mea. re then 
of the consequences of failures we ner 
correlated with Pruitt and Teger's be the 
norms for the likelihood of risky shifts; 


H 7 

highly 
two sets of ratings correlated very oreatest 
(r= —.70, p< 01); that is, pan least 
risky shifts occur for items with 


ition, 
serious perceived consequences. In ag tons 
the mean riskiness of subjects’ initial y^ per- 
on the items was correlated with Phitially 
ceived consequences: those items that Jativé 
elicit risk are perceived to have hos elicit 
trivial consequences, those that initial y A 
caution are perceived to have severe 


t 
ugges 
sequences. At the least, these data SU97 


ail- 

: E of fa^" 

that subjects? perceived consequences ^, pisk 
J I d 


ure play a part in their proneness towa" 
and caution, ‘able co? 
Taken together, the evidence availa of risk- 
verges to suggest that (a) treatment which 
relevant content, (b) in situations for eve 
the consequences of failure are -— hei 
(c) in groups whose members vary a wh 
initial riskiness, and (d) for individua Ying 
Perceive themselves to be at least B3 jresent 
as their peers to take risks jointly mot he 
the necessary and sufficient conditions four of 
risky shift. This is so because when all occu? 
these conditions are not met, the shift ó the 
(sufficient conditions) and when iy es 108 
four conditions is not met, the shift nm of 
Occur (necessary conditions). Since ME di- 
the conditions (except Condition b) othe 
rect derivatives of the risk-as-value BYP! that 
sis, the evidence is 
view 15 


" r 
strong support fo 


*8 Jellison and 
pleted 
Festing, 


om^ 

recently €^ 4o 

Riskind (1970) have eee oh L 
Studies that relate risk-as-value 
ers theory of Social comparison. 


GROUP SHIFT TOWARD RISK 


Current STATUS OF THE RISKY-SHIFT 
PHENOMENON 


E x maa shift toward risk using the 
m deg emmas as a measure can be shown 
(2) Rea * least four main characteristics: 
ward Pes a and systematic average shift to- 
ences. ; (5) the convergence of risk pref- 
no indi ae the ceiling effect, that is, that 
Preferenc ual's risk. preference exceeds the 
group ce of the riskiest individual before the 
Dönder RB occurs; ** and (d) the pre- 
iene Den of comments in the discussion 
s risk (Brown, 1965). 
Eo Post hoc basis, the familiarization 
tematic pue seem to account for the sys- 
aspects shift but none of the other three 
s Not only does the familiarization 
Ypothesis fail to account for the convergence 
Donde" the ceiling effect, and the pre- 
Riven oa of comments favoring risk but, 
dictions e hypothesis, one cannot derive pre- 
9f the ri oe these key characteristics 
acteristics ky shift, This is because these char- 
amiliariy, refer to group processes, and the 
Shift ph zation hypothesis regards the risky- 
Non, ique n as a pseudogroup phenome- 
the origi addition, recent studies show that 
es Sinal familiarization findings cannot be 
Dlicated, 
hs i hoc basis, the leadership hypoth- 
Suasibii its emphasis on the leader's per- 
Systema c?" would appear to account for the 
rence shift, the ceiling effect, the con- 
nd th around the opinion of the leader, 
tisk * preponderance of comments favoring 
View, owever, on an empirical basis, the re- 
from E studies has shown that derivations 
dict ana leadership hypothesis do not pre- 
nd account for the phenomenon. 
Tespon à post hoc basis, the diffusion-of- 
Count sibility hypothesis would seem to ac- 
the for the systematic shift and perhaps 
cusgi Donderance of comments in the dis- 
Sr E favoring risk, but not the convergence 
m WS or the ceiling effect. The former two 
220 hr authors ressar, among 1,920 events 
ng Ee X 6 Choice-Dilemmas Items) only 57 
bere than ee final decisions that. =~ Ln 
Poss; the ‘py initial positions of the ris Us € 
efje Pe oute oups. In other words, only 4% of the 
t. omes violated this assertion of a ceiling 
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characteristics can be derived from the hy- 
pothesis but not the latter two, since it is 
possible for each person in a group to become 
more risky without changes in variability oc- 
curring as a result. Since the hypothesis em- 
phasized generalized group processes, a pre- 
diction concerning the ceiling effect cannot be 
made. In fact, one might wonder why, as a 
result of diffusion of responsibility, the final 
decisions are not riskier than the riskiest 
individual. Although the hypothesis does pre- 
dict the systematic shift toward risk and the 
preponderance of comments favoring risk, ex- 
perimental studies reported above, especially 
the ones involving relevance of content, per- 
ception of self as relatively risky, and diver- 
sity of opinion, suggest that diffusion of re- 
sponsibility as a postulated process is not 
necessary or sufficient to produce the shift. 

In contrast to the other three hypotheses, 
on a post hoc basis, the risk-as-value hypothe- 
sis would account for all four aspects of the 
risky shift. The hypothesis predicts: (a) The 
systematic shift toward risk occurs when 
persons are not, in fact, expressing positions 
at least as risky as the norm; (5) the con- 
vergence of risk preferences around a central 
value is the result of value attainment; (c) 
the reason for the ceiling effect is that persons 
want to be at least as risky as their peers, 
and the fact that most individuals move to- 
ward the riskiest individual achieves this; and 
(d) the preponderance of discussion favors 
risk because the discussion of risk-relevant 
material allows the individual to compare his 
level of risk taking with that of others. Fur- 
thermore, the experimental studies reported 
above, particularly the ones in relevance of 
5 heterogeneity of initial opinions, and 
riskiness, demonstrate that risky 
under the conditions specifically 
he hypothesis. Finally, this hy- 
] advantage of being 
tious as well as for 


conten 
perception of 
shifts occur 

predicted by t h 
pothesis has the potentia 
able to account for cau 


risky shifts. . 
e troublesome problem is the 


However, on " a : 
sky shift on instructions. 


dependence of the risky shift 
The stability and replicability of the risky- 


shift phenomenon have been firmly estab- 
lished. In fact, most investigators begin by 
assuming its stability and have been search- 
ing for explanations of the shift. The two 


si 
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most popular interpretations, the diffusion-of- 
responsibility 
both point to 
the risky shift, 


However, there is evidenc 
periments by Clark and W; 


ents on this topic. 
red by the Kogan 
mas instrument, the 
in the research on 


Risk taking was measu: 
and Wallach choice-dilem: 
format most widely used 


were 
information-ex- 


15 The finding that risk-oriented instructions gen- 
erate conservative rather than risky shifts, or do 
not Cause shifts, for Materials of certain kinds, is 
not a serious contradiction to an instructional de- 
mand characteristic interpretation of the risky shift, 
but it does suggest that instructions are not the only 
factor that generate shifts. Wallach and Mabli’s 


"E 
Since risky shifts have occurred in auod 
magnitudes from different d for à 
other concepts have to be formula isky-shift 
complete understanding of the ri 
phenomenon. " ction 
Furthermore, the findings on bam 
dependence do nothing to distinguis account 
the two major hypotheses now used to nsibility 
for the risky shift. E R ol Jowest 
proponents might suggest that lient in 
probability instruction becomes san > of less 
discussion and enhances the -— À 
potential blame, The sep car vial p a o 
would probably suggest that the pi ve in 
Operate to persuade subjects to mi s. How 
direction of the more risky ups ie con- 
ever, the findings do suggest that s ns must 
ceptual and methodological e = arene w 
be placed on the range of generale ly, 
can make about the phenomenon. bn exer” 
the findings suggest that care must wal 
cised in interpreting the Kogan eo taki 
questionnaire as a measure of ris bá 
Until further research is conducted invo 
struction dependence and risk choices pe e; 
ing other experimenta] formats—for e~e ralize 
betting with Payoffs—we cannot ae inr 
instruction dependence to them. A uns 
Portance of such research should be e ructio? 
These data suggest, then, that ins nce: of 
dependence must be added to releva conse 
Context, in situations for which the eneilY 
quences of failure are not severe, asain f 
of opinions and distribution of percep When 
peers as a condition of the risky shift. strae" 
those conditions and risky-oriented e e 
tions occur together, the risky shite d 
Obtained; when any of the four eon I ud 
not met, it seems, the shift will not p 
Until recently, this seemed to be the aised | 
ut, as noted below, new research has T 
Some new and troublesome issues. , with 
One difficulty, as discussed earlier, rigi- 
the risk-as-value hypothesis as it was © no 
nally formulateq by Brown is that it pen 
account for the discrepancies found E 
Coo HM ol 
jew that Clark and Willems’ neutral x 
encourage a wider range of poite e 
force toward compromise and cone ce: 
ignores the finding that there were no different 


jented in^ 
etween the risk-oriented and neutral-oriente 
structions in initial responses or variability. 


lach 
ng 
in 


" 
Inform 4 
ion Biei Exchange, listening, and discus- 
EE ons. deger and Pruitt (1967), 
Ypothesis gued for a modification of Brown’s 
Condition ' pointing out that in the discussion 
Stoup m there is greater opportunity for 
the ea to acquaint themselves with 
ark bese in favor of the value of risk. 
Ment rely, al. (1971) performed an experi- 
Jets eith ant to this issue. They had sub- 
their Bade. (a) exchange information about 
Css the a on the risk dilemmas; (5) dis- 
| each i ilemmas and indicate their choices 
| hoiceg. item and their arguments for those 
| but Sy eae discuss the choice dilemmas, 
* vari give their arguments in favor of 
Vhic ee alternatives without specifying 
Bt natives they had chosen. In both 
cantly conditions, judgments shifted sig- 
Bteater uq oneta risk, in amounts significantly 
tition ses in the information-exchange COn- 
E ich did not differ significantly from 
With his Tesult appears to be inconsistent 
Wisk is e risk-as-value hypothesis, for since 
Ea k value, and we perceive ourselves to 
tisk, „ast as willing as our peers to take 
bosi Sh should need to reevaluate our own 
others ns only when we actually discover that 
: ye hoose riskier alternatives than we do; 
ieu, 5 significant shift occurred in the 
fhe "a condition when numerical prefer- 
EN anged, aL alternatives 
EY assu The hypothesis can b 
Pati, Ming that listening 
vii tisk in a discussion of argume 
fach ^ ven without clear commi 
usgan bject to an alternative, perm 
than i» to learn that his peers 2 
Üh had thought. There is € 
tha, Previous study, however, t 


subj 1 
Sth ubjects did change their percept 
ific alternatives. Su 


nts i 
tment from 
it the dis- 
re riskier 
vidence from 
hat suggests 
ions of 


«hers 
. lesponses to spec! 
fter discussion of 


lec 
the indicated before and a 
Roe dilemmas what they thought the 
Mark Y of people like themselves woul 
“Ussig for each item. Subjects in both dis- 
M the groups—with and without exchange 
th, € alternatives chosen—came to change 
tiski judgments to represent other people as 
No ex than initially; the two conditions did 
differ significantly. Thus, the data do 
ediation of the cultural 


Sy 
v,SBest that the ™ 
igh the specification of 


ue of risk throt 
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enable." However, a more 


probabilities is still t 
ded to substantiate this 


adequate test js nee 

view. 
Nevertheless, th 

change leads to soci 


e idea that information ex- 
al comparison processes is 
still a troublesome possibility. According to 
the risk-as-value hypothesis, the risky shift 
occurs because the individual discovers that 
he is not as risky as his peers; in order to 
maintain the implicit norm of riskiness, he 


becomes more risky. Recent evidence, how- 
ever, suggests that risky and cautious shifts 
do not occur when social comparison processes 
and elicitation of arguments in favor of risky 

are not biased 


or cautious items, respectively, 

toward the implicit values of the decision 
items. In a pilot investigation, Steiner (cited 
in Dion, Baron, & Miller, 1970), using both 
risk-oriented and cautious-oriented items, 
presented false feedback that disconfirmed 
‘ects initial expectation concerning the 


the subj 
responses of others. The manipulated norms 


produced marked shifts of own riskiness in 
the direction of the position advocated in the 


false feedback. 


Baron, Dion, and Baron" conducted a 


similar study. Subjects discussed either risk- 
oriented items. Using con- 


oriented or caution- 
federates, the investigators created a majority 
consensus that was either two scale positions 


riskier or more conservative than the naive 
subject's initial decision. Subjects exhibited 
marked conformity to the artificially estab- 
lished group consensus, regardless of whether 
the group norm coincided or conflicted with 
the underlying value elicited by the type of 
item. Furthermore, these shifts remained in- 
tact in private, posttest measures of risk 


taking. 


Clark and Crockett (1971) had low, me- 


dium, or high risk takers listen to contrived 
tape recordings of discussion groups com- 
posed of low, medium, or high risk takers 
on six risky items. The results were striking. 
16 Madaras and Bem, (1968) piéssitéd ndine 
that they interpreted as indicating that cultural value 
of risk is mediated by the knowledge OF others arts 
ments, for instance, information dou Fisk alterna 
ea po dd pei Banm P. Group 
aking. Unpublished 


norms, uiu i 
manuscript, University of “Minnesota, ‘Minneapolis 
Minnesota, 1968. í 


268 


RUSSELL D. CLARK, III 


Subjects always conformed to the position 
advocated by the taped discussion groups, 
regardless of whether the positions advocated 
coincided with the underlying value of risk. 

These results do not argue well for an 
interpretation that assumes that social com- 
parison pressures operate in conjunction with 
elicited values. If risk an 
cultural values, individu 
adjust their levels of 
coincide with the do 


d caution are Strong 
als should selectively 
risk and caution to 
minant values under- 
lying risk-oriented and caution-oriented items, 
Yet, the previous Studies showed that sub- 
jects conform dramatically to the majority 
consensus provided them, Thus, it appears 
that the social comparison mechanism of 
value theory fares Poorly, 


Evidence from a recen 
this 


composed three-person groups 
d either a conservati 


Were in a majority, 
toward caution whe 


ity. Procedural € groups 
unt for the dis. 
Crepancies in results, Wallach and Mabli did 


not directly manipulate scale position in their 
an item-by-item basis 
tigators. In addition, 
sky subjects had the 
e the more cautious 


We can onl 
culturally di 
risky ar, 


Y conclude that individuals are 
Sposed to generate and favor 
guments when considering risky items 


idering 
and cautious arguments when c 
cautious items (Brown, 1965; Ma i er 
Bem, 1968), but when forced to d risky 
cautious arguments and judgments we on 
items and risky arguments and NU their 
cautious items, individuals do mi the 
judgments in the direction qoom be- 
arguments to minimize the E av these 
tween themselves and others. Whet expos 
results hold up when an individual is d arg! 
to more heterogeneous judgments P oriment 
ments (persons in the above e tha 
heard only judgments and m ue their 
were homogeneously discrepant EI is € 
own), selected so that the individual ! nily 


- slig! 

posed to views that are the same tt = E 
different from his own as well as put for 
are very different, will have to ^ furth 
further experimental analysis. Such stand, 
work is important because, as p 
these results raise questions about pene 
value hypothesis and the risky-shift p. 
non itself, d termine 

Future research is needed to de neces- 
whether modifications of the above tion? 
Sary conditions are needed or if ad nt or 
conditions have to be found to accou 
situations where socia re 
and elicitation of arguments that = risk 
biased toward the implicit values of kw arly 
taking dilemmas, At present, it is partie ment’ 
unclear just what distribution of pee 
(percentage of risky and cautious argu to 


f H hifts 
must occur for risky or cautious S 
result. 


^ ress 
1 comparison p! 
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PERSONALITY AND ORGANIZATION 
F SOME CONCEPTUAL LITERATURE 


A REVIEW O 


CARY M. LICHTMAN + 


Wayne State University 


Since a theory of organ 
about the human personality, 
different theories have dealt with, 
personalistic” dilemma. The seve 
rubrics: (a) traditional structura: 
modern structural approaches (e.£ 
views (e.g. perceptual theories), and 
and role models). I 
and among organizations Ie! 
in understanding organizational 
tional contingencies. 


t is conclude 
nders 


When wi 
| ehavio, viewing the phenomena of human 
k T, it has been a custom among social 


Scient; 
entists to debate the relative importance 


f individual characteristics and proper- 
times they 


accom- 


tes 
ide 
o ma the problem, and, 
tion am have been sough 
longli 4 divides “personalists’ 
School S" despite the efforts 
ollan of “interactionist” Peace’ ection 
and der & Hunt, 1967, especially 9ec ions 
ATI for fuller discussion of these issues) - 
leories of organization: dealing 25 = 
En human collective undertakings, oor 
Qut Y expect to escape the conceptual 
from this “debate. 


k » Nor have they- And 

C i on 

ca Use we believe a theory of organizati 
be no better than t 


he assumptions it 
ak 5 

v about the human personality, © T 
diffe to review and comment on ve fot 
ev, Cnt theories have chosen to Tes? 

ade) the issue of relating 
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tean: 
Sanizations. 


construe organiza- 
(1964), as “collec- 
the pursuit of 
a more or 


we 


Telai: 1 establishe 
[tively specifie objective Saas 
continuous basis à 
'ganizational policies 2? structures may be 
laborate d separate from their member- 
: int should be sent to C. M. 
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etter than the assumption it makes 
views and comments on the ways 
the “structuralistic” versus 
der four conceptual 
and Weber), (b) 
(c) personalistic 
(e.g, system 
treme variability found within 
theories less useful 
nize situa- 


pulate and work them, and 
the interplay of people and structures defines 
the phenomena of organization (cf. Hollander 
& Hunt, 1967, pp. 352-353). 

Early theories of social organization em- 
phasized the more general sociological con- 
viction that the social structure was the 
primary determinant of differential human 
characteristics. Classical organization theo- 
rists, therefore, proposed that these structural 
determinants be utilized in the design of 
organizations to maximize human potential 
and organizational efficiency. Some two dec- 
ades later, however, the so-called neoclassical 
organization theorists offered models based on 
the personalistic views of psychology. React- 
ing to early sociological notions, this school 
of thought rejected the formal organizational 
structure as of small importance and placed 
the locus of the organization within the phe- 
nomenal fields of its individual incumbents. 
Then, reacting in turn to the one-sidedness 
of these two earlier approaches, many modern 
organization theorists acknowledged the value 
of each and attempted to integrate them into 
a unitary systemic conceptual scheme. Thus 
modern organization theory (cf. Scott, 1961) 
proposes that human behavior in organiza- 
tional settings can be understood in terms of 
three elements: (4) the stated design of 
functions, that is, the requirements of the 
organization; (b) the characteristics of peo- 
ple who populate the organization, that is 
the attributes they bring with them into the 


ships, people po 
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organization. 
other social 
tween the 


level emergents and 


"modern? organization theory. 
In their entirety, p 
organization do not al 


thus, they say, « 
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modes of compensating performance c. 
Opsahl & Dunnette, 1966). q 

Modern structural theorists pp 
the same logic, although basing it “that all 
ferent initial premise, They argue. ing 
men are interested in seh for 
realizing their full potential, and to 2 decel” 
this they recommend a looser, more ucrats: 
tralized structure than did the big! he 
Both views are logically cepe. 
position that since people share cer 5eop! 
portant characteristics, differences In Ferms 
or performance can best be explained "in the 
of differential positional occupancy tion 9 
organizational structure, The implica ri 
this typology is that theorists such as 4 well)» 
and McGregor (and some others Personal 
who have usually been considered Pheorists 
istic theorists, emerge as structural 
in the present context. ;,n deals 

The third section of the discussion o 
with those theories that emphasize fing 
sonalistic approach to the understan "m 
organizational behavior, In this cathae I i 
Viewpoints that stress individual cot, 
attributes, human experience, and inc! 
differences as behavioral determinants ‘in 4 
of these assume that employees act dividua 
viduals, while Others assume that indi 98. 
ehavior is mediated by informal group”, 


c 
; aor dire 
either case, however, the bias is in Hie rather ' 
tion of personalism or *psychologism 

than “sociologism” or structuralism. five 


The fourth section deals with pager: 
approaches to organizations, which at in 
to synthesize the various one-sided oe 
a single framework represented to be a ning 
accurate model of organizational functio 


Traditional S tructural Approaches | 


ti 
Probably the most ambitious attempt 
deal with the social-psychological aspec 
the socia] Structure was that of Marx Niro 
1887; or more particularly the collection. 
Writings, 1964). Marx was concerned bet 
Properties of the Social environment as E 
influence the characteristics of people par 
Populated that environment. In HERT 
Social stratification is seen collectively in tem 
orm of ideology; the stratification peri 
is a way of organizing the social or 
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Ide 1 , 
d B hen, is an expression of the strati- 
stratificati ystem and develops after the 
hus 4 n system is established. 
enced aired Marx, the individual is influ- 
important ih by the social order and has no 
tem of niin aracteristics aside from the sys- 
Change ich he is a part. The only way to 
in, ME is to change the society. 
Maintained Marx's lead, Durkheim (1902) 
Vision of that once a society establishes a 
ety can x labor, the properties of that soci- 
The bons defined independently of people. 
ined as acteristics of people can be exam- 
they belo function of the categories to which 
characterize: In other words, people can be 
Toperti tized as a function of the structural 
" des of society. } 
Writinn then, the earlier sociological 
eim o» of theorists such as Marx and Durk- 
Tetical pa be seen to provide à broad theo- 
of ase for classical organization theorists 
oue eA such schools of organizationa 
at as scientific managen 


tratiy, nent, admins 

lire, agemen, and bureaucracy, o- 
lCed Jn the Marxes and Durkheims m ee 
Ede ons of distinctions Hr 
Ose es theory and perse iy a = 
unt ge still tend to elu 

> 1969b). 


Sci 
lenti 
"lific Management 


(9), his | scientific B. 
Mees ) set out to study orgem 
entific precepts.” He wa 
arily in work performance: 


ou 
stu arci de We less energ 1 
Yield more work. This view 3 qum 
" ents, e.g., Gilbreth, 1911): rather, 
en with the worker 25 person; a 
'Otker was seen as an exin the worker 
9 far as the model Was conces V or separate 
tions of this *as- 


is jo 
imply had no life outside PE 
"i his tasks. The im iluminated by Dan- 
iel Sx status" € h is interesting to note 
is ell (1956). ^7. Tayloristic polemics are 
x Passing that man assionate debates about 
Oday reechoed hat was written before 
Computerism.” V 


S 
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t Taylor and scientific management is 
being written again about the inhumanity of 
the computer (cf., e.£.; Simon, 1965). 

In the Tayloristic model, work gets done by 
providing employees with work routines (or 
programs). There is no point, thought Tay- 
lor, in having men work at less than capacity 
because the work is poorly organized (the 
man-machine concept). The assembly line, 
therefore, can be run as fast as people can 
work, and time-method studies can be used 
to discover how fast people can work. In 
other words, experiments could be performed 
to discover the one best method or best condi- 


tions for doing the job. 

Apart from a kind o 
and a trust in monetary 
had no motivational terms 
Motivation was largely assumed as an in- 
trinsic moral matter. Furthermore, the “es- 
sential ability to work” was simply taken as 
given (cf. Neff, 1968, pp. 5-11, for a pointed 
discussion of this idea). But, if motivation 
and ability tended to be treated as intrinsic, 
organizational operation was viewed as de- 
pendent on extrinsic control. 

For Taylor, the organization was an au- 
thority structure to be described in terms of 
span of control. This was the key to organi- 
zational design and created a system based 
on a centralized pyramidal authority struc- 
ture. Taylor's plan was to devise a rational 
structure based on an authority hierarchy, 
with varying support functions (clerical, re- 
search, and development, etc.) where and 
when necessary—the line-staff notion. Thus, 
for Taylor, the guiding principle of organiza- 
tion was the span of control (or the author- 
ity structure), with components laid out in 
terms of vertical line and lateral staff. 

According to Taylor, the organization 
could then be specified in terms of organiza- 
tional charts (authority and line-staff) and 
job specifications. This was the organization 
the formal system. Nothing else nde 
The organization was thus defined in terms of 
scientific principles used to describe the for- 


abou 


f Calvinist work-ethic 
inducements, Taylor 
in his models. 


mal system. 
Confined almost exclusively to the vantage 


point of top management, Taylor's view of 
ganizati 
the organization suffers from a severe lack of 
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conceptual richness, Furthermore, 


Scientific management achieves conceptual closure of 
the organization by assuming that goals are known, 
tasks are repetitive, output from production processes 
somehow disappears, and resources in uniform quali- 
ties are available [Thompson, 1967, p. 5]. 


Administrative Management 


The administrative management views fos- 
tered by Gulick and Urwick (1937) stressed 
the structural relationships among produc- 
tion, personnel, supply, and other service 
units of the organization. Here again the 
emphasis is on efficiency, and many Taylor- 
istic principles are invoked. Efficiency is 
maximized by task specialization and the 
grouping of similar tasks into departments, 
Responsibility is fixed according to such 
principles as span of control or delegation 
and by insuring adequate control over em- 
ployees so that action conforms to plans. 
The assumption in this view is that a master 
plan is known so that Specialization, de- 
partmentalization, and control can be deter- 
mined rationally. (Simon, 1957, for one, has 
criticized this view by enumerating the diffi- 


culties in attaining agreement with and con- 
Stancy in master plans.) 


Like its relative, 
administrativ 
the organizati 


scientific management, 
e management commonly ignores 
ion’s environment and boundary 
cessary to transact matters of 
al survival with that environ- 


We might inter 
vation that revie 


movement is hardly dea 
more often now i 
oriented mana; 
model building, 
benefit analy. 


in 
operations research, cost- 
sis, etc, Moreover, it often dis- 
guises itself today in a more 
tems" terminolo. 
leaving behind 
fications 


comment might be 


g 


d at made about the human 
relations. tradition ( 
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pue lely à 
sometimes is painted as if it ^ n 
manifestation of the 1930s and 19 Gul i 
ever, unlike Australopithicus (or specially 
lives (cf., e.g., Whyte, 1969—note esr 

p. 708). 


Bureaucracy a 

views: 

Similar in many ways to the above weber 
bureaucracy (cf. Mouzelis, 1967; 


as 4 
1947) focuses on staffing and skonta ho 
means of dealing with clients and disp - 
cases—and the achievement of qme 
this view, efficiency is maximized by nd hier- 
ing offices according to jurisdiction z " 
archical position, appointing et work 
fices, utilizing procedural devices over” 
situations, the promulgation of rules b 3 
ing behavior of positional incumbents, 


to pro 
viding differential rewards by vp ad- 
vide incentive and patterns for C homp 


vancement (Quinn & Kahn, sow) theor 
son, 1967). As with other structura prope 
of organization, Weber felt that the cou 
use of rules, rewards, and punishment? i by 
render the human component negligi from 
divorcing the individual’s private oe sig 
his role as an officeholder. Once more facom- 
nificance of individual identity was Uim, 
prehended: Indeed, the belief was wi ie a 
plicit that “workers,” unlike manager 
other "good" people, had none. sation 2 
The static nature of the organiza » elo 
described by Weber inspired numero? pable 
Quent attacks on bureaucratic theory: ? ted bY 
among these were the criticisms sugges deal 
Merton (1957) and Gouldner (1954), of 
ing with the unintended consequent” 2 
bureaucratic rule enforcement. Mer ion? 
Sued that reliance on depersonalized re risdic 
and strict enforcement of rules and ju pilitY 
tions in order to reduce human Var ition 
lead to rigid behavior on the part of po net 
incumbents. This rigidity, based on ptt 
to defend individual actions, leads to € the 
on organizational precedent and use d 
Most easily invoked rule in dealings W! ea: 
ents. Often, however, the rule most Jien 
invoked is least appropriate to the C reat” 
case, The client’s challenge of the pe is 
Cats judgment serves only to E 
drive to defend his actions and thence E 
creasingly rigid and inappropriate beh? 


jes 


Ji- 
silY 
1 
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In a somewhat similar vein, Gouldner 
Ea 54) pointed out an additional self-defeat- 
i consequence of reliance, stemming from 
m ands for control, on the use of general 
ise ene rules. He argued that the 
S ira such rules offers little incentive for 
m yees to do any more than the minimum 
Whe nt of work that will avoid punishment. 
ity s management discovers that productiv- 
on ES fallen behind goals, pressure 15 placed 
serve pervisors to supervise more closely. This 
lation to increase the visibility of power re- 
even s within the work group and leads to 
of th more dogged determination on the part 

T e workers to do as little as possible. 
jg pen (1961) added that bureaucra- 
pervi end to attract monocratic types to su- 
ings ad positions who tend to enhance feel- 
er helplessness and insecurity in subordi- 
Probl, Jasinski (1956) has found even more 
E. with the overreliance on efficiency 

be s: He specified seven: wasted time, 

os “a maintenance costs, figure “fudging, 
unit norale, impaired labor recruitment, higher 
Sede and lowered product quality. 
ing ee has provided an especially amus- 
Mg account of bureaucratic blundering and 
Drie ee in the United States De- 

hie tee State ee his book A Thousand 
Scathi (1965), and Bell (1956) has penned a 
tratie c indictment of the Tayloristic-bureau- 

Oth cult of efficiency.” F 
19 a writers (e.g. Argyris, 1964; Bennis, 
Ho Haire, 1963) have objected to bureau- 
m ond because it is based on the as- 
ue that man is lazy and not to be 
Vine > They further object to the accompa- 

we eer notion of the intrinsic goodness 
Dlicit ; and the punishment orientation im- 
t in classical theory. These critics agree 
stin, *Ssical models of organization Serve to 


Cra. 


ca] ., Yorkers in their quest for “psychologi- 
Success » 
As s 

tha, TS (1964), for example, has argued 


t I f 
ca] „° the degree that the dictates of classi- 
the Bu. are followed in an organization, 
ie Wer level employee will suffer because 
W ac 
of 1 — 

E ex 2 abilities will be used . - A 
os E a sense of dependence and submissive- 
E a hue his superior because decisions of whether 
° on, a works, how much he shall be paid, and 

Will be largely under his superior's control 


. [hel will tend 


. [and] the worker will tend to experience a 
decreasing sense of selí-responsibility and seli-con- 


trol [pp. 38-39]. 


Although most of the critics of classical 
organization theory view bureaucracy as either 
an ideal type or an undifferentiated lump, 
Hall (1963) has shown that the major dis- 
cussants of bureaucracy cannot themselves 
completely agree on à list of definitive char- 
acteristics. Furthermore, in an empirical 
study, Hall demonstrated that organizations 
that are highly bureaucratized on one dimen- 
sion may not be so on another. The dimen- 
sions studied by Hall were hierarchy of au- 
thority, division of labor, system of rules, 
system of procedures, impersonality, and em- 
phasis on technical competence; other varia- 
bles have been dealt with by Woodward 
(1965), Perrow (1967), and Harvey (1968) 
with comparable results. Clearly, then, bu- 
reaucracy exists in degrees, and there are very 
few so-called bureaucracies conforming in all 
ways to an ideal type. Nevertheless, most of 
the critical writing in this area tends to view 
it as an all-or-none phenomenon. The con- 


clusion to be drawn is that classical organiza- 


tion theory is often used as a straw man or 
figurative foil for both critical virtuosos and 
theorists advancing alternative models. 


Modern Structural Views 

Sharing the same general rationale of or- 
ganizational design as the traditional struc- 
tural theorists—that the organization must 
be designed to fit the nature of people—the 
modern structural theorists, by altering their 
assumptions concerning people, have proposed 
alternative designs to bureaucracy. Instead of 
assuming as did classical theorists, that all 
workers are motivated by rational economic 

modern structuralists undertake to 
he worker as à total human being 
for self-improvement, self-expression, 
autonomy, recognition, or whatnot. It is fur- 
ther assume that these strivings exist m 
equal amounts in all people but that their 
expression depends on the degree to which 
the social structure 15 able to accommodate 


such strivings- 


striving 


The model used by this approach is nicely 
Merton’s (1957) reformula- 


exemplified in ) 
tion of the concept of anomie. The problem 
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for Merton was to explain why the frequency 
of deviant behavior varies in different social 
Structures. He viewed the task as one of locat- 
ing groups which typically engage in deviant 
behavior (or maladjustive behavior from the 
societal viewpoint) as normal responses to the 
social situation in which they find themselves, 
Specifically, 
cultural axiom: 
ends of succes 
by virtually all 
He defined su 


S with the 
especially for in- 


in the Organiza. 


adi es Such as mini. 
tivity, sabotage, low morale, and 


Mos S 
knowledge the exi mur 


x Xistence of i 
ences, but they proceed to bui 
as if these differ 


ences are of no account, 
Influences on these views 


The immediate 
Test with the Hawthorne studies (discussed 
4) theoretical for- 


later) and Maslow’s (195 
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ed 
1 ES i nt ne 
mulations emphasizing man’s inhere 


B re 
to use his capacities and skills in a "e 
and productive way. Maslow specified Je an 
hierarchy common to all normal wr an 
consisting, in ascending order, of m E 
survival needs, social needs, ego dee Hee s. 
tonomy needs, and selfactualizätion until 
A higher-level need cannot be satisfie e been 
the needs below it in the hierarchy Bave i 
served. Maslow argued that it is univ js an 
true that all manner of human per 
attempt to work upward in the ructur 
Using this basic notion, modern str in 
theorists feel that since membership men’ 
work organization is a central fact P ranizi" 
lives, it is to the best interest of the eras j- 
tion and the people in it to alter the human 
zation’s structure to allow for the 
quest for self-actualization. a view Ë 
Among the major advocates of this remisi 
Argyris (1957, 1964), whose chief P quire- 
was that there is a lack of fit between i om 
ments of bureaucratic organizations dividudl 
mal Structure) and the needs of in cce" 
members to achieve “psychological SU 
Argyris wrote: 


d 
se tow?" 
our selection of the individu mp : ! 
Psychological Success is not 

There is an incre 
who believe that self-esteem, self-acceP 
Psychological success are some of the EU 
factors that constitute individual mental we 
Our culture. If we are able to understam y ind 
how one may enhance the opportunity that po 
vidual Psychological success, we believe peal 
will contribute toward 
[1964, pp. 36-37] 


al who p i 
itrapv ONE “atte 
an arbitrary ;cho git 
number of Psy v ani 
tanci entr? 
jn 


asing 


enta 
individual men 


J 

workers at the lower end of the organ s 
hierarchy, as opposed to those at hig! 
els, suffer from poorer mental healt di pr 
Job Satisfaction, lower levels of sel z vati? 
eelings of insecurity, and other relate 
bles, The 
Subordination, and passivity on trat et 
lé Workers can only result in “frus nfiie 
failure, short-term perspective, an 

964, p, 40].” ton that 2 

Further, Argyris took the position ren d 
Organization Operated on classical empl; 
suffers inefficiency to the extent me d i 
ees waste energy by engaging in activ! 


5 


a compulsive, defensive nature. Thus, there 
is less human energy available for genuine 
Productivity. As an outline for a solution, 
Argyris proposed a “mix model" in which 
Changes in the organization toward increas- 
ing opportunity for psychological success and 
Self-esteem be made as long as we are able 
to show that they are decreasing the unpro- 
ductive compulsive activities [1964, p. 147].” 
Argyris is quick to point out that this does 
not necessarily imply a people-centered or- 
Sanization because self-esteem is strongly 
ae with responsibility to others. F urther- 
re, organizational changes designed to ac- 
Commodate psychological success should con- 
tinue only so long as the organization can 
More efficiently achieve its objectives, main- 
tain its internal system, and adapt to its 
Environment. i 
aperis view, then, contains a powerful 
ced in the reality of the forces exerted on 
‘dividuals by an organizational structure. 
aes Personality is considered a constant, 
in — differences can only be explained 
and rms of differential positional incumbency, 
<a, Indeed, the lower organizational levels 
jj; em as the most restricting with respect 
uman need satisfaction. 
we lcGregor (1960) embraced a similar ue 
Sin he proposed “Theory yy as a genera 
ution to the problem of organizational im- 
wm ement, Theory Y represents a set ie as- 
h Qn dn about human motivation, m = 
ser, YS need hierarchy, which he felt shou 
Mec. as a basis for organizational design. 
€Gregor argued: 


S 


A 
bove all, the ory Y 


" ie Int ùp 
he f. assumptions of The point ul 


“ct that the limits of human collaboration 1n 
"ganizational setting are not limits of human 
but of management’s ingenuity in discover- 


how à T 
h m 9W to realize the potential represented by its 
eas, l'esourc inagement an 


as Du 

ia organizational 
Or H 

teso mance: It js due to the nature of human 

the 68 with which we must work. Theory Y, on 


implies that the 


Ve : = 
Causes i uncooperative, Theory Y t 
ganization 


a ie i : : 
nd issu ba methods of or 

Th ` . 
ot : 
haie Words, the structure of the organi- 
lS responsible for difficult employees, 
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and an alteration in this structure is called 
for. 

Other theorists have, in whole or in part, 
fostered a similar point of view. Likert (1961, 
1967) did so by arguing that “System 4” 
(trust and participation) is the only leader- 
ship style which can yield maximum effective- 
ness. Gellerman (1963) posited competence 
and power as general motives and suggested 
that the structure take these into account. 
And Blake and his co-workers (cf. Blake & 
Mouton, 1968) also proposed a “one best" 
route to managerial effectiveness—the 9-9 
theory. 

The solutions proposed by the several writ- 
ers holding to modern structural viewpoints 
suggest reordering organization away from 
bureaucratic designs so that they will allow 
for participative decision making, face-to-face 
work groups, mutual confidence between su- 
perior and subordinate, job enlargement, in- 
creased responsibility at lower positions, and 
so on. 

The modern structural theorists have met 
with criticism on two main grounds: Their 
global personality assumptions and their pro- 
pensity to blame the structure of the organi- 
zation for all of its ills. Examples of the first 
type of criticism have emanated from the pen 
of Strauss (1963, 1968). Strauss felt that the 
notions of such people as Argyris (1959), 
Herzberg, Mausner, and Snyderman (1959), 
Maier (1955), Maslow (1954), and Mc- 
Gregor (1960) are laden with an inner- 
directed academic bias in which righteous 
professors impose their own values on mem- 
bers of population segments who may have 
quite a different idea of self-development. 
J E Galbraith (1967), however, felt that 
modern structural theorists seem to feel that 
if workers gain most of their satisfactions 
away from their jobs, then the organiza- 
tion has somehow failed in both efficiency 
and elementary humanism and must, of neces- 
be restructured. Strauss (1963) con- 
tinued by asserting that the injudicious use 
of power-equalization practices may actually 
adversely affect the security needs of those 
workers who are not prepared for the added 
responsibilities that accompany an expanded 
role. Indeed, à closely related phenomenon 
figured prominently as a source of disturbance 


sity, 


ET 
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in Levinson, Price, Munden, 


Solley’s Midland Utilities st 
Further 


b 


Mandel, and 


udy (1962), 
> power-equalization techniques may 


than they are 
tuations where 
or keeping up 
hat is required 


EY and organi- 
zation). Moreover, like their traditional 
counterparts, the modern structuralists are 


Fiedler, 1965), 
Sayles and Strauss 


Ployees who do highly 
skilled or Creative work jn large companies 
(e.g., research and q ienti 


revert to a production 
System based on individual craftsmanship, 


Katz and Kahn (1966) have pointed out, for 
instance, that a 


a good many mass-produced 
automobiles can be bought for the price of one 
olls OYCé. As noted above 
] 


S voiced simi] 


he literature 


ven aspects of organizational 


ere, ave been 

zation being the two possible exception? tations? 
shown to have some kind of significant to bo 1 
to either job attitudes or job behavior, experiment 
these types of variables, However, . "between sm 
"proof" of cause-effect relationships jor is elus" 
ture and employee attitudes and behav 
and almost nonexistent [p. 47]. n the liter 
[T]here are already enough indications br to invest 
ture to support a greater research or properties © 
gate the interactions among Sed Eoia yee as 
organizations in their relationship to evious tb A 
behavior and attitudes, Too much gra neglect 
rizing in the area of organizations s there PE 
such interaction possibilities and hence; gify vast? 
been an unfortunate tendency to oves ions ple 
the effects of particular variables. ipie varia 
Dear to be much too complex for a fect acros 
to have a consistent unidirectional e asl). 
Wide variety of types of conditions [p. contin 

Similar sentiments relative to pa have 
Sent nature of organizational proc ture. APA 
become commonplace in the a ecen 
their pertinence is underscored sani 
Work on determinants of formal 2 structure 
Structure and the impact of forma ickso™ 
on member characteristics (cf., (T espe. 
Pugh, & Pheysey, 1969; Whyte, erties 
cially Chapter 3). Structural gari men” 
Organizations surely “seep into” ne reverse 
bers, as Udy (1965) stated, but t relatio? 
is also true, and, in any case, the 
usually are not simple. ; le 

Other criticisms have also beer 


É ctura 
against the human relations stru 
Pugh 


s 


zatio? 


veled 
jism 
cori 
(1966) has noted that e t 
reacting against the classical schis organi" 
have an implicit bias against € organi 
tion. Leavitt (1962) felt that muc rela jon? 
tional restructuring based on human principle 
Philosophies has been done on ier often 
of defeating Taylorism and has th water 
thrown out the baby with the d over 
Leavitt felt, in particular, that has serve 
Teaction against classical theories 3 of p f 
to impede the salutary ion and informatió? 
ciples from communication and E a. 
theory into organizational design. s again i 
ichtman ? have cautioned oe on tha 
giving their employees the qom mes i 
Productivity is somehow not impot it onl 
the first place, it is not true; second, 


rin” 


d 
ncepts d 
*Hunt, R. G, & Lichtman, C. ep deu 
Strategies for training work supervis University 
counselors, Buffalo, N. Y.: State 
New York, 1969. (Ditto) 


-—. 
eer 
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Serve. 
stress Fe ontse employees and may create 
Orientation ose places were a human relations 
\atzel] (1962 supposed to relieve it. And 
tional or 2) has cited evidence of conven- 
tioning eanizations that seemed to be func- 
E uu well and of human relations 

Boil P were rejected by workers. 

tural th d seem, then, that current struc- 
lions iip with emphases on human rela- 
Coretica] made many of the same errors of 
COrists presumption as did the classical 
Teactin Mor whom they were ostensibly 
Detsonati otably, their assumptions of global 
the Jod characteristics give their theories 
e E n ended, closed system, quality of 
broposi ical theories. They have a Way of 
ci cy A the one best solution to the effi- 
RES ra all organizations and the one best 
| Dot o i^ prn all members. Their view then 
Urect: nly simplistically one-sided, but uni- 
Onal: Effects flow almost entirely from 


way 
1 


the organ; 
cc anlzation's properties to the individual. 
ti Mpared with modern varieties, tradi- 
r, be said 


to qe ueturaliem might, howeve 
ani à somewhat friendlier attitude toward 
p enans and a somewhat less humanistic 
a its members. Traditional struc- 
be SEM tend to see organizations 35 societal 
Gig, tors needful only of augmented effi- 
Organi, Whereas the modernists look upon the 
ita spon as a more oppressive agency 
the p © for reform. In either case, however, 
Su perier prevails that apt structuring of the 
2 ia: oF the organization will allow te 
m ual’s natural inclinations to issue fort 

a. Ore effective performance. And Taylor 
hee convinced as anybody that this would 

n EAR by happier workers.) i 
Ogica] is discussion of "work as à PIET 
ed ^r d Neff (1968), however; ar- 
ho c at it is evident that people exist 
px. not (or will not) work effectively 
tion wi the best of conditions. The ques- 
Person, raised under the rubric of the ‘ work 
logic di is, in essence, What psycho- 
al characteristics are requisite to entry 
a erformance in «work roles"? (cf. also 

S, Lofquist, & Weiss, 1968). 


Org, 


l 


b, 
er. saan 
"sonalistic Views 


Althou 1] theori i 
gh global eories of personalit, 
have been of benefit to oiganiaation ox. 
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management theory—especially their implica- 
tions for organizational design—they are 
often difficult to study in a precise empirical 
manner, and they do not provide sufficient 
differential predictive power to be useful by 
themselves for an understanding of organiza- 
tional behavior. As à result, many students 
of organizations have chosen to concentrate 
their theoretical and empirical efforts on 
the discovery of the influence of more par- 
ticularized measures of personality. Schein 
(1965) commented: 


Man is not only complex, but also highly variable; 
he has many motives which are arranged in some 
sort of hierarchy of importance to him, but this 
hierarchy is subject to change from time to time and 
situation to situation; furthermore, motives interact 
and combine complex motive patterns (for example, 
since money can facilitate self-actualization, for some 
people economic strivings are equivalent to self- 
actualization) . Man is capable of Jearning new 
motives through his organizational experiences . . - 
and... is the result of a complex interaction. be- 
tween initial needs and organizational experiences. 
Man's motives in different organizations or different 
subparts of the same organization may be different. 
...Man can become productively involved with 
organizations on the basis of many different kinds 


of motives . . . [P- 60]. 
Many of the same 


points have been made 
by Hunt (19692) in discussion of the role 
of profit in business affairs. In any event, the 
general assumption of the personalistic view 
is that people react to their organization on 
the basis of their perceptions of it. These 
perceptions are based on people's needs, 
motives, and values. Therefore, to understand 
human behavior in organizations, one must 
understand how individuals differ with respect 
to personalistic variables; to change organi- 
zations, one must alter the perceptions of 
people. The impetus to this view rests largely 
with the writings of Kurt Lewin and the 
Hawthorne group. 

Lewin's field theory and his group dy- 
namics work (cf. Lewin, 1951) prefigured 
more than one important contemporary pat- 
tern. of organizational analysis. The group 
dynamics research, of course, was one of the 
roots of T-grouping; while field theory per 
se constituted one of the early gestures in 

sychology toward a systemlike interpretation 
individual and group behavior. Among 


of A 
for instance, Lewin argued that 


other things, 


4o0U 
action was no sim 


cal Stimulus-response linkages, 


shifting momentary fields of for ^ 
chological environment, the life 


I ^ 1968). The process 
of organizational analysis must then consist 
argely of Personnel asa 


; Organizational 
in the changing 
Se is fundamenta] (if 
implicit) to what may be termed “clinical”? 
ion development (errs, 
T-group Perspectives: 


1933; Roethlisberger & 
were being conducted. Alt 

nally Conceived as a test of Taylorism—a 
Search for the one best level of illumination, 
among other things—the findings indicated 
that there was no one best level, Rather, 
workers Stepped "D production at all levels. 
The Hawthorne Studies became important 
principally because the investigators went on 
to ask wy there Was no relation between 
illumination and output, and the studies con- 


Dickson, 
hough origi- 
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classi- 
ducted to discover the answer led to neo 
cal organization theory. . con- 
From their studies the neo t 
cluded that friendship patterns y advocate 
of organization. They subsequent A iniques | 
human relations as a set of v main- 
Which people could be motivatec ‘nat is pe 
taining that the organization is e case tH 
ceived by the employees to be t tion theory 
Proponents of neoclassical organiza. notions 
abandoned the formal structural 


ten 
* nt sys 
the classical theorists, The importa 


truc 
+ ational S n 
was no longer the formal organica orga" i 
ture, but the informal relations a pee? | 


zational levels. What had cpm 
called the forma] structure, then, W: rkings 9 
but the manifestation of the pee ody $ 
the informal System (or else S " 
idealization of it). abso 
As the indi e researchers voli il not 
ever, the implications for Toe organiza 
lie in any gross restructuring of e: individus. 
tion, but rather in à program O erception” 
counseling designed to change the p' rgani 
workers toward the 1 Fren 
tion. The classic studies of Coch pot fi 
Lewin (1958) were gu. 
Shoots of this line of thinking. B of o 
these studies the persuasive potente as à 
formal group membership was ceptions. 3 
vehicle for Changing people's percer entio! 
the structure as if is; there was e syste™ 
of changing the basic structure of t eople fe 
there was only the attempt to get E came 
accept it. Although this new nt he 
to be called Participative momaka pc 
fact was that management had no changing 
of altering the organization—only ot x: Amon 
People’s attitudes toward hee ne the 
others, Bell (1956) has been voca relation? 
“manipulative” qualities of d (19 
Operations, and recently even Benni 
as commented on this tendency. 


thing 
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sub” 
The Hawthorne researchers ho ha: 
ject to sweeping critique. Carey ( follow? 
argued that their conclusions neither m the! 
from nor were they supportable us exactly 
data; if anything, they RU ove 
What they maintained had [s influenti 
Yet, be that as it may, few 
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Works could pe 


er iound—they ushered in an 


a. B yr 
avthorae Pus (1965) has pointed out, the 
Notion of ec ie led to the demise of the 
actually m, i era man without, however, 
of analysis 3 social organization the focus 
that man B he Hawthorne studies argued 
an Mute motivated by a good deal more 
Worker's pe ry self-interest; indeed, that the 
Man cent ^w group could actually guide à 
‘ablishin Y from his own self-interest by es- 
Ough i^ norm of restricted output. Al- 
Yas emph group factor in work situations 
d System ; asized, and the concept of social 
| Teig, ioduced, an individual focus wes 
View w he function of the group, in p 
the 


B wW 5 
individual to relieve the boredom of 


Utside the The work-group € 
reality € organization by defining 
also <; NOt only of the work situa 
Th the worker’s place in the lar 

. Wüüeq € work group as an informal s 
life out Roy, provides a continuity between 
nera] de and inside the organization. his 
the lit Position has received wide support in 

Fe ime (e.g., Bennis & Shepard, 1969; 

Dan 1950; Walker, 1958)- 
ng ert (1961, 1967) typifies the group 
Mics approach in his advocacy of group 


deg 
cisi 
on processes at all levels. He proposed 
consisting 


tion but 
ger soci- 
tructure, 


, 


oy, for example, would 
e E the work group 
Visor member of the next hi 
mro, level. Thus, the supervisor 
at .P is a subordinate in the next, and so on, 

Successive levels. Horizontal as well as 


er 
n tical linkages are built into this pin 
roups rather 


than emphasis is clearly on B8rOUL" = 
lineo individuals. “An organization will 
no, ON best when its personnel function 

às individuals but as members of highly 
gh performance 
» Likert felt 
decision 
ng to his 


but also à 
gher super- 
jn one 


Ds do work groups with hi 
thus [Likert, 1961, p. 105]. 
Maki employee participation in 
be E when accomplished accordir 
Viso ing pin” notion, will not reduce super- 

ry influence (a view shared by Tannen- 
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baum, 1966), but rather will serve to change 
the attitudes of work-group members t 
better conform to organizational goals. More- 
over, like others of its genre, Likert's solu- 
tions to organizational problems rarely carry 
important implications of a basic structural 
order; indeed, by and large they presume 
a generally bureaucratic system and speak 
mainly to operations within it (cf. Bennis, 
1970). 

Despite contemporary recognition of the 
influence of informal groups on the formal 
organization, many modern theorists have 
preferred to limit the domain of their analy- 
sis to individual experience. Vroom (1964), 
for instance, has recently applied Lewin's 
general theory of personality to the study 
of organizational behavior, hypothesizing 
performance to be a function of the worker's 
perception of the abilities required by the 
job, the degree to which the person perceives 
himself as having these abilities, and the 
degree to which he values the possession of 
such abilities. Vroom, thus, stressed the 
affective consequences of the degree of con- 
sistency beween a person’s performance and 
his self-concept. In this view a person is 
motivated to perform effectively when effec- 
tive performance is consisent with his con- 
ception of his abilities and with the value 
he places on them. Porter and Lawler (1968) 
have similarly hypothesized that managerial 
performance is a function of the perceived 
value of the reward, the probability that ef- 
fort will bring the reward, and the accuracy 
of role perceptions. Smith and Cranny (1968) 
have presented a similar formulation, and 
March and Simon (1958) and Cyert and 
March (1963) have also stressed personal- 
istic variables in their individual decision- 
making models of organizations. They have 
used such expressions as “the perceived con- 
sequence of evoked alternatives,” “the ex- 
pected value of reward,” placing the locus 
of the organization within the individuals 
who populate Ib 

Katz (1964) has posited that the motiva- 
tion to produce and motivation to belong 
are orthogonal dimensions, and this indepen- 
dence helps to explain the corresponding lack 
of relationship between job satisfaction and 
productivity Bass (1965) argued that it is 
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not necessary to study workers with reference 
to any basic needs or tensions; rather, there 
is just as much predictive power to be had 
by restricting research to “organizationally 
relevant” motivational dispositions. Task 
orientation, self-orientation, and interpersonal 
orientation are the three proposed by Bass. 
These and some related ideas have been well- 
summarized by Quinn and Kahn (1967). 

In his book Expsychian Management, 
Maslow (1965) took a more radical view by 
arguing that work in itself has a kind of 
symbiotic quality. Satisfaction can indeed be 
gotten out of good work, but this is not a 
social or socially derived ( ego) need. Rather, 
work is a primitive need independent of so. 
cial need satisfaction and can be intrinsically 
interesting. In principle, Maslow’s view is not 
altogether unlike that of Bell (1956), which 
we have cited, or of Herzberg et al. (1959), 
who have claimed that the only true job satis- 
faction can come out of work itself. Good 
working conditions and satisfactory social re- 
lationships around the job can only serve to 
make the job tolerable. 

A review of the literature concerned with 
studies of individual worker differences was 
written by Smith and Cranny (1968) and 
provides further evidence for the value of 
such research, In sum, recognizing the inade- 
quacies of classical organization theory, many 


he coin and concentrated 
ts toward evaluating the 

d the work group as 
e also encouraged an 

roach to 7 
of organizational behavior, by d 
intraperso: along which people 
may vary. An especially clear illustration of 
this can be found in Hackman’s recent 
(1969) reworking of the Herzberg two-factor 
notions about work motivation. 

But this view is no less one-sided than the 
structural approaches because it has little to 
say about the formal organization. Any 
comprehensive theory of organizational he. 
havior, of necessity, has to integrate the 
formal structure, the processes occurring 
within individuals, and the function of in- 


formal groups (cf, Hunt, 1968; 
1967). 


The solution 


be- 


Perrow, 


» according to Scott (1961), 
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lies in the remembrance that the main pM 
lem with the structural and personali 
views was not one of inacurracy but rat In 
incompleteness and _ shortsightedness. ra- 
neither approach is there adequate -— 
tion among the numerous facets of huma 
havior studied by it. The task of the wu 
organization theorist, therefore, is to ee 
the several views into a more accurate ™ 
of organizational functioning. : the use 
Mechanic (1963) has championed the "i 
of the multivariable or “eclectic” eager 
to organizational studies, as opposed to view 
sided” approaches committed to a single ne- 
of human nature, He maintained gee 
sided approaches tend to encourage : thus 
Partisanship to a particular view ang ation 
engender misunderstandings, communieh, ey 
difficulties, unjustified attacks, while ation. 
also discourage interdisciplinary cooper? iy 
However, if “one-sided” analysis is defined pe 
as a strategy of approach rather than a doris 
assertion of priority, the dialogue between = oss 
may be improved and communication ree 
fertilization may be preserved at the same tinmicis™ 
[While it may be valuable to urge anticis 
in theory building, it is essential to urge ecle 


mos 
i logic 
in the use of research approaches and methodo ani- 
Perhaps the mosi 


r 

t significant recent trend a [^ des 
zation studies is the utilization of various chanicr 
of data collection in the same study [Me 
1963, pp. 172-173]. 


he 

The virtues of the one-sided versus the 
eclectic approach depend, in part, xt to 
Purposes of the theorist, If his purpose ade 
convince bureaucratic managers of e 


H ere 
vantages of human relations practice treme 
1s probably no harm in taking a more € oy. 


Personalistic view of human behavior P ak 
ganizations; presumably, the target formal 
ready familiar with the operations of is 

rules, Tf, however, the theorists intent sode! 
build an accurate and workable general < a 
of organizational functioning, then it 15 sed 
that none of the one-sided views DI? ctio? 
to date can do that job. The next e y 
deals with some approaches to the disc? pal 


of a general solution to the organiza 
dilemma. 


Integrating Approaches 


: myriad 
Increasing disenchantment with the hes E 
of one-sided and normative approach 


m 
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a theory has led many writers to 
RA s use of alternative models. Such 
CR A cá represent attempts to inte- 
ids xisting approaches and/or reflect the 

nce of situational factors on organi- 


zation EIS x 
com functioning. Katzell, for example, 


b 
esting: themselves with certain preselected 
are RON ...and not with others that 
Unproven bes the system, by relying heavily on 
Scheralize assumptions, and by their tendency to 
Tounding rather than paying due attention to sur- 
_ the lus circumstances, such theories typically take 
Y Prescribed si a blueprint or master plan which is 
anag as the one best way to organize and 
ge all work [1962, p. 104]. 


factors must be 


Katzell felt that many 
decided 


vW " hoe account before it can be € 
like " Mie e policies and practices are 
Sügpeste, work best, As a starting point, he 

ir E five such parameters: 
Katz z first parameter mentioned by 
ent or is SEL or the number of interdepen- 
(1969) ganizational members. Hickson et al. 
have also given prominence to size 


fact, 

ors i : e 

Struct, in their analyses of determinants of 
tur, ure, Indik (1963) reviewed the litera- 


bar eating organizational size to member 
r apes in terms of absence and turn- 
ftologie finding which calls to mind Barker's 
yan Cal studies of schools, Cf., €. 1963). 
these ae paper, Indik (1965) interpreted 
zation, "ings by positing that larger organi- 
Comm; contain more potential and necessary 
te e, cation linkages among members, 
cult ng adequate communication more dif- 


fo fo achieve. The result of inadequate 


Municati 
Cte e cation among members serves to de- 
tur interpersonal attractiveness and, in 
r and 


La e (oer participation rates. Porte 
ied, also reviewed the literature on 
igh tional size and concluded that al- 
le satisfaction and morale tend to 
din, T in large organizations, the general 
e relevant to this area are neither 
te, Ds = clear. These authors felt that 
o Minant ubunit size is a more crucial de- 
"ganizat; of employee reactions than Is total 
Dressed pOr size. Woodward (1965). t00. has 
> Stressi a more differentiated concept of 
Tefere, ing the importance of indexing size 
nce to the size of managerial sub- 


oe CE 
TEF pos 
5 6g, 
ES 
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systems. There is clearly a need for more 
research in this area.? 

2. The degree of interaction and interde- 
pendence is another parameter suggested as 
important by Katzell. Katz and Kahn (1966) 
have theorized that the degree of task inter- 
dependence is an important consideration in 
choosing the appropriate authority structure 
for an organization. They argued that where 
such interdependence is high, where creative 
requirements are minimal, and where identi- 
fication with organizational goals is not re- 
quired, a hierarchical authority structure, 
with its implication of close supervision, is 
indicated. Where the opposite circumstances 
obtain, a more democratic alternative should 
be used for maximum efficiency. Vroom and 
Mann (1960) have proposed that differ- 
ences in work-group task interdependence 
account for the choice of autocratic super- 
visors by some groups and democratic 
supervisors by others. 

More generally, Hackman * has shown the 
importance of task structure to patterns of 
group interaction; Sommer (1967, 1968) has 
demonstrated the pervasive influence of 
broader ecological factors in group (and, by 
inference, organization) processes, and W. F. 
Whyte (1969) may be consulted for a wide- 
discussion of various aspects and 
“the structuring of the work 
(cf. especially Chapter 3) 
Hunt (1970) has discussed at length the 
organizational significance of task uncertainty 
and has identified distinctive modes of struc- 
tural adaptation to it. Relatedly, Haberstroh 
(1965) provided a review of the role of in- 
formation systems in task modeling and or- 
ganizational design. He noted, for instance, 
that “the coherence and unity of an organi- 
zation is intimately related to its information 


3 Recently Blau (1970) has pu 
theory relating organization size 
ferentiation that could prove a proe 
such research. He touches on various nuances of 
both dimensions, but his “basic gen zations are 
(a) increasing organizational size & enertes differenti- 
ation along various lines at decelerating ates; and 
(b) differentiation enlarges the administrative com- 
ponent in organizations t0 effect coordination 
| Abstract, P- 201)" — 8 i . 

4 Hackman, J- R. Functions of group interaction 
Paper presented at Colloquium of the State Uni 
versity of New York, Buffalo, October 1969 


ranging 
implications of 
environment” 


iblished a systematic 
and structural dif- 
ductive guide for 
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Therefore, he main. 
ch systems will precipi- 
ational] repercussions,” 
the idea of close and 
f information Systems 
- also Cleland & King, 


1968; Glans, Grad, Holstein, Meyers, & 


personalities of organiz 
motivations, expectations 


ence measures (Smith & 
patterns of i 


1970). 


Cranny, 1968) and 
mmediate supervision (Ronan, 


to be considered as a determinant of organi- 
zational design. Katz and Kahn (1966) dis- 
tinguished i 


jobs through 
- Bell, 1956, for a useful gen- 


Of this issue). Mann and 


and Blauner (1964) have 
: automation has occurred, 
the remaining joh 


na S provide more freedom, 
responsibility, anq dignity, The result is a 


ter 
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zer 
+ worke 
: in w 
decrease rather than an increase 


alienation, ] opostd | 
The second type of anri d Pol 
by Katz and Kahn occurs throug b m 
zation of organizational goals f motivation | 
through the job itself, This type on and Sell 
is associated with value expression tions thls 
identification. In industrial e agree amone 
type of internalization is usually fo volun x 
management personnel, although he rank am | 
organizations it may extend to t a ver, tà 
fle. Katz and Kahn argued, poder Mp. 
Complete internalization of org of p? E 
goals is less common than two types concer 
internalization, The first of wot are 10 | 
general organizational purposes t k ied 
unique to a particular organizatio zd re 
tist, for example, may internalize, 
search values of his profession A 'emp A 
extent that the organization pP E 
him shares similar goals, Pu E. 
Worker and organizational goals WI any 


Era k in 
However, the scientist could wor 
other or 


of 


essionals whose attachments to 
ar organizations are derivative activity: d 
as a simple locus of occupational NU z 
The other subtype proposed by 
Kahn is the internalization of organ" 4 | 
goals and values, Since one's own 08%”, 


* him 7i 
tional unit is often more visible to ar 
i 


i 


N 


but, in general, it is clear that midi | 
and variety of employee nad à 
have a profound influence on Org 

Policy and practice. | 


v 
to do 
5. Katzell's final parameter E. neces$ 
Who in the organization has 


D 


E 


l 


abili 
1 will fore Motivation to take action that 
969) hen objectives. Rubin and Hunt 
Specia] bins discussed this matter with 
Identificati SUCCELO what they called "mission 
ion." After reviewing some of the 


Iterat. 
ur E X 
| thap C 9n supervision, Katzell observed 


conditi to organize and manage work is 
4 ave tl ioned by those in the organization 
Job done, ine knowledge and motivation to get the 
to Mold 'ü ore specifically, it would seem necessary 


"Em he organizational system so that it maxi- 
w] 
Who 


re of action and initiative for those 
ott, C mplian will take effective action, while elicit- 
hers, D ce, support, or noninterference from 
"Xpertise epending on the organizational loci of 
Tem Page dedication, the. appropriate work 
1962, p, im" widely from situation to situation 
pes concluded that “the sheer number 
Organizatie that can be used to describe 
and pp, Ons is staggering [1962, p. 105] 
them E ways must be found to work with 
enti eaningfully (and we have hardly even 
197p Ped technological variables; cf. Hunt, 
S005 may to do this, he proposed 
Tprisingly), is to discover the geno- 
dimensions that underlie phenotypic 
Dlexities, 
1s the recognition of the vast number 
that Complexity of organizational variables 
in em a been responsible for a decided shift 
the, Phasis on the part of many organization 
itu Sts over the past decade or so. Perhaps 
of ,,/More accurate to say that recognition 
Struch, complexity of the facts of social 
Sible pi and social organization was respon- 
Dsyc; T RD belated development of the 
izat gical approach to the study of or- 
the lons. Until perhaps two decades ago 
Social sciences lacked the conceptual tools 
Sota] ang with many important facets of 
introg ife (Katz & Kahn, 1966). With the 
Oben uction of the biological notion of an 
Socja] 15 em (von Bertalanffy, 1950) into the 
Came sciences (Miller, 1968; Morane, 1967) 
the 2 practicable possibility for integrating 
Plethora of one-sided views of social 
Sanizations. 


A 


lY pic 
Com 


Soc; 

| Cial Systems 
a Among the earliest and most generally 
Seful systemic theories of organizational be- 
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havior was the theory put forth by Homans 
(1950). Homans commenced by positing that 
any social system exists in a three-part en- 
vironment: a physical environment (geogra- 
phy, climate, etc.), a cultural environment 
(the norms and values and goals of the 
society at large), and a technological envi- 
ronment (the state of knowledge and instru- 
mentation available to the system). At the 
next level the social system itself has certain 
requirements and goals that are translated 
into specified activities and interactions for 
members of the system. The behavior re- 
quired by the system or determined by its 


environment is called the external system by 


Homans. For example, in a work organiza- 
s a large ele- 


tion, management constitute 
ment of the external system by making 
decisions that bring certain people together 
in particular ways, for instance, through job 
specifications, work methods, prescribed lay- 
outs, and selection of personnel. 

Although the people brought together by 
the management may be strangers at first, 
they eventually come to know one another, 
develop cliques and friendships, socialize on 
and off the job, help one another in their 
work, agree to restrict production, and so on. 
They now have another basis on which to 
associate, one that modifies and influences 
their behavior over and above (and perhaps 
instead of) the effects of the external system. 
The new form of behavior comes about as a 
reaction to the demands of the external sys- 
tem. Homans called this phenomenon the 


internal system. 
Homans further specified that the ele- 
] system can be sorted 


ments of à social 
into three categories which contain aspects 


of both the internal and external systems: 
Activities (the things people do, the 
acts they perform), interactions (activities 
that link people together so that the activity 
of one person has an effect on the activity 
of another), and sentiments (internal psycho- 
logical states, €£: emotions, feelings, beliefs, 
values). Homans postulated that activities, 
ractions, and sentiments are mutually de- 


inte 
pendent on one another so that a change in 
any one will produce a change in the others. 


For example, positive sentiments between two 
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people lead to increased interaction, or vice 
versa. Homans further argued that the inter- 
nal and external systems are mutually depen- 
dent; for instance, technology influences 
interaction, which affects informal relation. 
ships, Conversely, the Hawthorne studies 
demonstrated how the informal system influ- 
enced “formal” production quotas, 

Lastly, the two systems and the environ- 
ment are mutually dependent. The environ- 
ment may change the formal Organizations 
(e.g., through federal and state 
union activities) 
to alter 


More recent co 
of organizations as 
Social Psychology 
Katz and Kahn 


mprehensive discussion 
open social systems is The 
of Organizations in which 
give precision to 

ization by including 
them as part of a formal model within which 


of their book, Katz and 


common characteristics of 
Open systems: 


1. Open systems, like biological or 
Must import some form of energy 
environment in order to survive, 

2. Open Systems cont. 
Process by which the 
transformed: the through- 
tion may 
the providing 


€nt of human beings, etc, 


3. "Open Systems 


ganisms, 
from the 


€Xport some product 

whether it be the 
inquiring mind Or a bridge 
an engineering firm [p. 20].” 
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for | 
x ewal | 
ment furnishes sources of energy ren 


eatec. 
the input so that the cycle may be d that 
It is important to note in this con 
the structure of a social system 


vents which 
is to be found in an interrelated set MP 
return upon themselves to complete ethan t 
cycle of activities. It is events rather, Sia 
which are Structured, so that social s! 


21]. 
dynamic rather than a static concept [p- 


ping 
ise 


terize 

5. Open systems are further ciara a 
by negative entropy. An open — than it 
more energy from its wc coU 
expends; it can thus store energy 1 
entropic forces. tran 

6. Energic inputs, which hen m i 
formed or altered by the throug Inform? 
not the only form of system er coding 
tion inputs, negative feedback, anc characte 
Process are inputs of an into in 
and serve to provide the system un rela 
tion concerning its own functioning ! j 
to its environment, ized bY is 

7. An open system is characteri is) hi 
steady state and dynamic homeosta equilib” 
does not imply a motionless or Reus to pr 
rium, but rather à force that seeks parts 2 
serve a constant ratio between the ^ in ae 
the system, Thus, when one element "oyy 


nte 


hara 


form" 
tio? 


| 


t, ar 

od 
a 
on 


1 to pres r^ 
System changes, forces are exerted 


ro] 
the character of the system by the Eent 
tional alteration of all other ization’ 
Thus, as with living organisms, organ 
evolution is a symmetrical process. i 
8. Open systems become more ‘ 
tiated and elaborated over time. jaliZ? 


n ec} 
global patterns give way to more SP 
functions, 


9. Finally, open systems are char 


g 


„ed 
zei 
acter o | 


: re 5 
by the principle of equifinality. There yen 


one best way 
final state fr 
Contrariwise, 


for a system to reach i stat 
om a particular initia : ope? 
given similar initial paries state 
Systems may reach quite different -— en 
Tn short, there are a variety of paths istent” 
any two given points in a system's ext 

The 


] 

ten 

i i f open-SY5 an 
underlying motion o 

theory i 


S that everything that gus a at 
appen is dependent on everything eh at 
“oes Or can happen; that is, all eve "E 
Correlated. Thus, the concept of EX use? 
effect is discarded. While a closed syster 
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Ul H 

pen energy in the throughput process 

Ei onto tee simpler over time as a result 

Pen Ce an open system has a 

Süstenanc my ad that allows it to draw 

(1959) E rom its environment. Newcomb 
as written: 


Th 
rather eaae to emphasize “system properties” 
em, and co; single variables which contribute to 
endurin nsequently none of the variables has an 
^ g status either as independent or as depen- 
(under e Change in one system variable is likely 
Sbecifieg ein conditions) to be followed by a 
according ta es in another system variable, „but 
Precongits © others a change in the second is à 

lon for a change in the first [p. 3881. 


All 
a pai of the system change when the 
S Tedistribir one is changed. That 1s, om 
equilib ion of energy through some fori 
theres, ibration process. An open system is, 
Ore, dynamic; it is expending energy; 
(Do as a result, it is in process and ongoing. 
übtless the most wide-ranging treatment 
ona ed concepts and applications will be 
in Buckley's 1968 compendium.) 

Vigoro open-system model has also been 
e usly advanced by the Tavistock group 
B., Rice, 1963; Trist, Higsin, Murray, & 
the s 1963) who have combined it vis 
Syst ess general concept of the sociotechnica, 
a The purpose of the sociotechnical 
both Pt is to bring into t 
the he ideal efficiency of th 
h me atticability of attaining the 10° 
Work. beings and realistic conditions © 
Mee ge productive organization, Wm is 
echn is the result of the interaction between 
ey and the system of relationships 
ea those who perform the jobs Sona 
Othe system). Each system influences e 
Sio er (cf. Hunt, 1970, for a detailed discus: 
i. of these relations). This notion, O 

tse, is not unilike Homans’. : 
the Owever, the Tavistock group, bY adding 
Ope Notion of a sociotechnical system to the 
teen. “system model, served thereby to add 
of Ignition of the importance of the channels 
interaction between the organization and 
S environment. Thus, the organization suf- 
me constraints from two sources. On the 
t € hand, the environment controls raw ma- 
rial labor, money, and so on, all at the 


Input side, while the output of the organiza- 
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tion depends on such variables as consumer 
preferences, the state of the economy, and 
government regulation. And if that is not 
enough, additional constraints are imposed on 
the organization by the expectations, values, 
and norms of its human incumbents. 

Of utmost importance in this model, how- 
is the notion that human capacities, 
preferences, and expectations are not neces- 
sarily personalistic elements, but may be 
influenced by experiences within the organiza- 
tion. Thus, the functions of personnel selec- 
tion and organizational design are of equal 
importance—human behavior in organizations 

e interaction between formal 


is the result of th 
role requirements and the nature of people 


(cf. Hunt, 1967). 
In a more recent volume from the Tavi- 


stock Institute, Miller and Rice (1967) have 
employed a boundary model as a framework 
for examining problems of organizational 
control (see also Tannenbaum, 1968). The 
focus of this model is to make a distinction 
seen the boundaries of the activity system 


betw 
(“that complex of activities which is required 


to complete the process of transforming an 
intake into an output [p. 6],”) task group 
(comprising those individuals employed in an 
activity system), and sentient group (“being 
the group to which individuals are prepared 
to commit themselves and on which they 
depend for emotional support [p. 253]”). 
Miller and Rice presented various organiza- 
tional models that represent different degrees 
of coincidence between task, sentient, and 
administrative boundaries and evaluate each 

ability to control task 


in terms of their 
performance, to ensure the commitment of 


members to the organizational objectives, and 
to regulate relations between task and sentient 


systems. 

One 0 
Miller and 
which task 


ever, 


f the major hypotheses posited by 
Rice is that in organizations in 
and sentient groups coincide on a 
more or less permanent basis, only short-term 
effectiveness can be achieved. “In the longer 
term such groups can inhibit change and 
hence can lead eventually to deterioration of 
performance and, in consequence, to social 
and psychological deprivation rather than to 
satisfaction [P- 253].” As an alternative, 
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Miller and Rice 
Systems as a 


felt that such groups (such 


iscourage the 


later (1968), 
deas as these 
gn. 

In a related vein, Crozier (1964) has ar- 
ation there is 
rather, each 


pendence, 


may numerous 
accounted for by the formal structure. 


he recognition by the Open-system model 
of the legion of variabl 


affecting the behavior of 
not, however, imply that si 
randomly distributed throug 
tion; they tend to be s 
For example, Parsons 
that organizations three distinct 
levels of responsibility and control—technical, 
Managerial, and institutional, 


In his view, an organization contains sub- 
Structures that exert forces to 


tioned above, 
relationships 


with the through- 
the Processing or conversion of the input— 
and strive to achieve technical proficiency, 

€ Managerial structure services the tech- 
nical subsystem by procuring necessary re- 
Sources for the input and by mediating be- 
tween the technical structure and the en- 
vironmental elements to which the product 
will be exported. The managerial level con- 
trols or administers the technical substructure 
(not necessarily Unilaterally or even unidi- 
rectionally, it should be noted) by deciding 
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or- 

Such matters as what the business ot EN 
ganization shall be, the scale of its ies 
and employment and purchasing poll oe 

Finally, the organization itself is em o 
wider social System, which is the P ization's 
higher level support for the goce rela- 
goals. Although an organization ma formal 
tively independent with regard to 1t onment 
controls, it must depend on its geal 
for meaning, legitimation, and pier favor- 
overall responsibility for maintaining lies at 
able environment for the organization 
the institutional level. n of the 

Parsons further argued that at n three 
two points of articulation between ; in th 
levels there is a qualitative break because 
simple continuity of line pene 
the functions at each level are quas d, not 
different. The difference is one of ral 
degree. Organizational survival “sates s 
adequate functioning of these three Jevel 
Well as adequate interaction, If any 1%. 


+. dissi 
: P agri is dis 
withholds its contribution, the system 
pate. 


Katz and Kahn 
Parsons! notion by o 
articulation of basic 
tures, Production st 
with the work that 
Structures Procure the 
output, and dea] with i 
maintenance stri 


anded 
fine! 
truc 
rne 


(1966) have pi 
ffering a somnis 
organizational su E 
ructures are conc rtiv 
gets done; SupPO he 
input, dispose “jonsi 
nstitutional re ro 
Ictures promote adequa ve 1 
performances; adaptive subsystems Sepang 
change the organization in line with a € tems 
ing environment; and managerial gubera ai 1 
direct, adjudicate, and control the oe jon 
subsystems and activities of the organiza at 
The upshot of this line of thinking 15 rde- 
since these basic substructures are M i- 
pendent, a change in any one will bear 1” in- 
cations of change for the others: n for 
Stance, differences in technology will ma an 
differences at the managerial, supportive will, 
other levels; differences in environment ro 
in turn, place different demands on the ther 
ductive, managerial, supportive, and O 
levels (cf, Hunt, 1970; Udy, 1965). sev" 
Thus a strong case has been made by “the 
eral recent theories of organization for jon 
recognition of the nature of the interact ue 
between a social system amd its envir 


PERSONALIT 


me; 

0 E th the one hand, and its people on the 
as a wed the use of the open-system model 
describin ework and conceptual language for 
ity for i organizations there is the possibil- 
plex) der thorough (but also more com- 
lor: erstanding of organizational behav- 


It is 

become Soon to predict whether this approach will 
a dominant guide to empirical work or 

ON eni will become merely the latest 
among organizational psychologists 


[Qui 
"n & Kahn, 1967, p. 4611. 


enis 
ntribution of Role Theory 


In 
a |, aS past, the mind-body problem was 
cal discussion. 


"0 
he mind-body 
he precise 


Indi 

a ev personalistic forc 
soluti at least, probably the 
ave th came about when Linton 


b 

es n explicated and advanced by such writ- 

Baas Merton (1957); Shibutani (1961), 
d Allen (1968). 


i (1965), and Sarbin an : 
rather recently, however; the concep! 
ins occupied only à secondary position 
Carlie Social science concepts, bc: 
notio sociological and anthropologica Jole 
ita) ce tended to focus solely upon [ M 
Syst, influences on personality. The adv 5 8 
Bro. €ms approaches to the study of large 
eg however, gave the concept of role a 
bi lease on life by balancing it with the 
*'sOnalistic forces that have traditionally 
en the concern of psychology: 
is Fs mewhere within the action 
s functioning organization, $ 
oon forces jointly draw @ bea 
S e, on the individual, controlling 

rmances. And we have now reached 
es of development in role concepts where, 
x their exemplary analysis of organizations, 
atz and Kahn (1966) proposed vule ‘cone 
Cepts as: 


4h 


system that 
tructural and 
d, as it 
his per- 
the 
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the major means for linking the individual and 
organizational Jevels of research and theory; it is at 
once the building block of social systems and the 
summation of the requirements with which such 
systems confront their members as individuals [p. 
197]. 

Highlighting distinctions between structure 
and person while concerned with their integra- 
tion, Hunt (1967) pointed out that a social 
system may be viewed conceptually as an 


interlocking complex of positions (the struc- 


tural aspect), or functional divisions of labor, 


which are to be populated by people who are 
expected to fulfill certain behavior require- 
ments and possess certain personal attributes 
associated with each position. Since a social 
system is a system of interdependent activi- 
ties, positions within a social structure exist 
in relationships with certain other positions. 
Thus, the “rights and duties” of a particular 
focal position exist with respect to a counter- 


position. 


This complementary contrast is, of course, a function 
basically of the complex patterns of behavior organ- 
ized around these positions and embodying the rele- 
vant mutual expectations (the rights and duties) 
vis à vis one another held by occupants of positions. 
It is possible, therefore, to regard social process as 
an interaction of positions patterned in terms of 
these complementary expectations which are them- 


selves called roles [p- 2591. 

Thus, role and its personalistic correlate, 
identity, represent the implications of social 
position incumbency and can be compre- 
hensively described only with reference to 
other roles, which bear a complementary re- 
lation to the focal role. 

We cannot explicate the ideas here, but we 
can say simply that since persons occupy 
multiple positions in life and are only partly 
involved in any single position they occupy, 
they have multiple identities that combine in 
various ways to affect their views and the 
enactments of their singular roles. And, what- 
ever else may be involved, the modes of a 
man’s participation in structured social inter- 
course will be reflected in his concept of him- 
self and in the fabric of his personality. 

The essential point, then, is not simply 
that role serves à linking function; more than 
that. In its modern forms, it exemplifies and 
operationalizes the merging of social and indi- 
vidual phenomena heretofore treated sepa- 
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ratistically. Roles are undoubtedly social phe- 
nomena. However, they are not only external 
"demands," they are dynamic interactional 
processes carried out by individuals who color 
their performances personal. By way of reci- 
procity, however, through their identity im- 
plications roles become operationally integral 
to individual personality. Thus, roles more 
than link the individual and the social (or 
structural)—they unite them. 


Role Conflict 


Cyert and MacCrimmon (1968) made 
some of these same points, in their discussion 
of organizational processes, and accorded a 
prominent place to role concepts. In particu- 
lar, they identified organizational roles as 
points of linkage and also as principal sources 
of conflict between individuals and organiza- 
tions. In the study cited earlier, Hunt (1967) 
explicated five role varieties that are helpful 
in conceptualizing role differentiation. "These 
focus attention on the interplay of patterns of 
expectation and performance within the role 
System. The importance of the distinctions 
he draws between aspects of role taking is 
that a lack of fit between any of them will 
result in some degree of role conflict, the 
effects of which have been the topic of 
relatively recent major organizational stu 
and resulting theories. 

Gross, Mason, and McEachern (1958) 
studied the perception and resolution of role 
conflict among school superintendents, They 
focused on both interrole conflicts (resulting 
from the occupancy of multiple Positions) 
and intrarole conflicts (resulting from the per- 
ception of conflicting sent-roles 5 vis à vis a 
single focal position). On the basis of their 
data, they concluded that there were three 
major cognitive styles employed by superin- 
tendents in the resolution of Perceived role 
conflict. 
he first style they call th 
tion. Individuals in this ¢ 
weight to evaluating the 
conflicting expect 


two 
dies 


=] 


e moral orienta- 
ategory give most 
legitimacy of the 


ations or the right of others 
to hold such expectations, With minimal at- 


^ Communications by persons intended to į 

a y infi "e 
the role behavior of others (cf, Kahn, Wolfe Quinn, 
Snoek, & Rosenthal, 1964), i s 
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tention to the likelihood of sanctions for e 
conformity, this style emphasizes — 
to the demand seen as most legitimate. T! n 
second type of orientation is called exp -—- a 
Priority here is given to a consideration te 
the sanctions others will bring to bear i ra 
individual does not conform to their a 
tions. The resolution style in this case Y 
minimize negative sanctions involved in. ht 
role-conflict situation. The nature of ues is 
of others to hold particular expecta e 
not nearly so important in this case as 
perceived severity of the probable sanctions | 
This superintendent, therefore, will choose e` 
conform to the expectations of those whos j 
sanctions he fears most. Moral expedient un 
the third orientation toward role Ys 
resolution. Here no primacy is given ee 
dimensions of legitimacy or sanctions; rat Y 
both are considered equally in choosing 
course of resolution. ; of 
Among the numerous empirical findings, n 
these authors was the high job satisfaction | 
reported among superintendents who lat 
ceived agreement between the sent-role fron 
the school board and their own role ics 
iypes. What is probably conceptually ed 
important was the consistent finding of M 
et al. (1958) that the effects of role pe 
on job satisfaction and expressions of "i by 
among the superintendents were mediate? di- 
the characteristic anxiety level of the So 
vidual. Implicit in this finding is the nO an 
that personality may be the result of n 
interaction of social structural conditions “in 
Personalistic dispositions, The difficulty © 
this particular study is that Gross an role 
colleagues dealt only with perceptions O the 
conflict and did not attempt to deal with ce 
accuracy of perception. So the social $ rch: 
ture is not truly represented in their ym 
Clearly the most ambitious attempt s n 
Ploy the role concept in organizati y- 
studies is that of Kahn et al. (1964). BO" set: 
mg Merton’s (1957) formulation of à s ally 
these authors built a model that is spec a jal 
designed to integrate properties of & ut 
Structure. with Characteristics of people rol? 
Populate it. They called their model 2 7, 
episode and posited that it consists M 
series of events. Specifically, role pres 


m- 
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emanate from members of the role set—the 
discrepancy between their own expectations 
and their evaluations of how well the focal 
Derson's behavior measures up. Members of 
Uhr role set attempt to reduce this discrep- 
a. cy by exerting pressures on the focal person 
to adjust his behavior to conform with their 
expectations. These pressures induce percep- 
tual and cognitive experiences in the focal 
Person that can lead to certain adjustive 
Or maladjustive responses. The role-senders 
Monitor any subsequent behavior of the focal 
m and reward him if they are satisfied 
iu his adjustment or exert further pres- 
res if they are not. This is a cyclic, ongoing 

. Process, 
in Role conflict and ambiguity may thus occur 
| WO ways. In the first case it may occur 
x Rs property of the social structure where, 
ex act, the sent-roles from role-set members 
Xert pressures for various mutually exclusive 
chaviors on the part of the focal person 
(ct, also Hunt, 1967). Role ambiguity exists 
aig the sent-role expectations lack suffi- 
Cent clarity to be translated into behavior by 
E person. The second variety of role 
ma ict occurs at the perceptual level and 
el Or may not correspond with an actual 
be of affairs, Thus the focal person may 
Exist Perceive conflict where it actually 
hot $ or he may perceive it where it does 
inq AS, OF he may perceive it where it does 

ced exist, 

bos thermore, Kahn and his associates hy- 
a €sized that the degree of structural and 
. er Celved role conflict varies as a function 
fona] CE al enduring organizational and per- 
lona] Characteristics, For example, organiza- 
| Con jo tions vary in their vulnerability o 
\structured expectations. Typically, the "a 
the o and explicit the job requirement, 
pt inel the position is to the boundary 
ber of Organization, and the larger the num- 
Clateq Ped subordinates, and equals asso- 
e tliat a given position, the greater x 
sions ‘hood that role conflict (and the 
Eka and frustrations, which may occur 
Such Ult of its experience) will vary with 
mig, ralistic dispositions as needs for 
x. "b tolerance of ambiguity, neurotic 


k Aiet 
Y, or need achievement. 


€—m 
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Thus, these authors posited that the degree 
of role conflict and its relation to individual 
experience and reactions is highly conditional 
and depends on numerous organizational and 
personal attributes. Furthermore, variables 
such as job satisfaction and general mental 
health are anchored, via the role-set concept, 
to both the social structure and individual 
dispositions. These authors have further con- 
tributed to knowledge by operationalizing 
and testing their hypotheses—successfully. 

The theories and data of the Kahn et al. 
(1964) study and the Gross et al. (1958) 
study have seriously challenged many old and 
static notions of organizational behavior. 
They have demonstrated crucial functions of 
personalistic dispositions as mediators be- 
tween formal system requirements and indi- 
vidual outcomes. In other words, neither can 
stand alone as a comprehensive explanation 
of human behavior in organizations. Further- 
more, they have provided evidence of the 
inadequacy of the personality versus organi- 
zation hypothesis. While Gross et al. have 
shown that simply holding a high-ranking 
position does not render a man free from emo- 
tional problems, Kahn et al. have provided 
data comparing six hierarchical organiza- 
tional levels and showing that tension in- 
duced by role strain increases with increasing 
rank. Since role conflict is linked with emo- 
tional reactions and mental health in general, 
it is apparently true that if the right mea- 
sures are chosen, management positions may 
not be as rosy as the personality-organization 
theorists hold. Kahn et al. have clearly shown 
that the organizational locus of emotional 
maladjustment can occur anywhere in an 
organization and is not necessarily restricted 
to unskilled or low level workers. Employee 
reactions. apparently depend on the nature 
of the organization, the nature of the job, 
and the nature of the person. It seems plain 
that Katzell's (1962) thesis warrants being 
taken seriously (and Quinn & Kahn's, 1967, 
as well). It is neither meaningful nor useful 
to promote normative, one-sided theories in- 
tended to account for all organizational situa- 
tions. There is good evidence that organiza- 
tional behavior is the outcome of a variety 
of highly conditional and highly contingent 
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relationships and situations. Future theory 


will need to build on the foundations of those 
premises, 
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PROBLEMS WITH THE USE OF «MODERATOR " 
VARIABLES 


SHELDON ZEDECK 1 


University of California, Berkeley 


Alternative concepts of “moderator” 
terms of whether they 
predictability, or involv 
of results provided by the tl 
of variable (qualitative versus qua 
in terms of statistical and methodo 
the operation and identifica 
Finally, utility of the moderator V2 


In 

Validity, e Hull noted that the upper limit of 
of ao ciente for predictors of criteria 
e order of .50. In 1966, Ghiselli re- 

Unpublished and published validity 
à pr be found that a validity coefficient 
of impro, ictor rarely surpassed .50. The lack 
Tehorts Ovement in prediction between the two 
2 eee modifications of the classic 
. The tonal validation model. 
Simple Classic validation mode 
dictor Lx of the relationship 
ning E criterion and ignores 
bles, 5 "etween behaviors on the 
tion -S a result, newer strategies for valida- 
bartie predictors have been suggested. In 

Best ar, Guetzkow and Forehan (1961) 
‘lection a prediction model to be used in 
Sis o n of executives that requires the analy- 
teria multiple predictors and multiple cri 

as modified by multiple behavior types 


ahd « 
Stray, ations. Dunnette (1963) extended the 
co, SY and suggested the analysis of the 


Or 

oye = interactions that may occur between 
oup edictors and criteria; that 15; different 
has or types of individuals, different be- 
T is tS on the job, and different situations. 
(a) Modified validation model allows for 
qijj, P'edictors to be differentially valid for 
Job p groups of individuals, (2) similar 
batte ehaviors to be predictable by different 
“rns of interactions between predictors 
to dividuals, and (c) similar job behaviors 
Quen, E differential organizational Conse- 

Ces depending on the situation. 


Tego 
an 
f rch 


1 provides a 
between pre- 
factors inter- 
two varia- 


nts should be sent to Sheldon 
of Psychology, University of 
California 94720. 
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2, Requests for repri 
leck, Department 
fornia, Berkeley, 


variables are defined and categorized in 


lead to differential validity, 


e moderated ri 


hree approac 
ntitative) examined as a m 


logical problems. The major problem 


ation of mode 
riable concept is examined. 


t2 


or lead to differential 
egression techniques. The dissimilarity 


hes is discussed in terms of the type 
oderator and 
s of 


rator variables also are discussed. 


Considerable research has been devoted to 
the identification of homogeneous sets of in- 
dividuals as a result of the modification of 
the classic validation model. These particular 
research endeavors generally have been classi- 
fied as “moderator” variable research. The 
existing moderator techniques are not en- 
tirely independent and achieve the goal of 
identifying homogeneous sets of individuals 
either through psychometric manipulations or 
through classification. procedures. However, 
analyses of the same set of data by the vari- 
ous techniques have produced differential re- 
sults. Hence, the purpose of the present study 
is to differentiate the various moderator 
variable techniques and to present the prob- 
ith the use of moderator variables. In 
the efficiency and potential utility of 
ches are discussed. 


lems w. 


addition, 
moderator variable approa 


MODERATOR VARIABLES AND PROCEDURES 


The term moderator variable is used to de- 
scribe the notion of *population control 
variable" (Gaylord & Carroll, 1948), “sub- 
grouping variable" (Frederiksen & Melville, 
1954), "referrent variable" (Toops, 1959), 
“predictability variable” (Ghiselli, 1956), 
“modifier variable" (Grooms & Endler, 
1960), and “homologizer variable" (John- 

Saunders’ (1955) original 


son ê). In addition, 
definition of the term moderator variable 


2 Johnson, C. D. The population control variable 


or moderator variable in personnel research. In Tri- 
Service Conference on Selection Research. Washing- 
ton, D- 


C.: Office of Naval Research, 1960, 125-134. 
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has been generalized by Banas? to include 
qualitative as well as quantitative variables. 
The procedures used for the above concepts 
are different and, consequently, often lead to 
dissimilar results, For purposes of illustrating 
the techniques, the procedures are categorized 
in terms of whether they lead to differential 


different validity coefficients) , lead to differen- 


n Over that of 
ion). The latter cate- 
€ moderator variable 


Differential Validity 


These techniques examine the influence of 
a third variable, qualitative or quantitative, 
on a predictor-criterion relationship. The 
third variable is Used as a basis for splitting 
the tota] group into Subgroups. That is, rxy 


for Group 4 js significantly different from 
"xv for Group A (not 


A), where A is a quali- 
ative or quantitative descriptive variable. 


variables, A 


8 Banas, P, A. An investig 
moderators, Unpublished do 
versity of Minnesota, 1964, 

* Scates, D, E, Selective 4. 
College students at the 
published doctora 
cago, 1926, 


ation of transsituational 
ctoral dissertation, Uni- 


dmission and retention of 
niversity of Chicago. Un- 


l dissertation, University of Chi- 


Simi- 

a function of location and size of n an 
lar results were obtained by W pe on 
Strabel (1935) who subgrouped x sought 
sex, residency, high school, age, degr ki ddition: 
and other population variables. In f adjust 
sex (Abelson, 1952) and degree e been 
ment (Hoyt & Norman, 1954) idities 108 
found to provide differential yali 
predictors of success in academics, subgroup 

In a frequently cited study a (1954) 
analysis, Frederiksen and Melvi vn interest 
hypothesized that the usefulness as roups 9 
test is improved by identifying su E tes 
male engineering students for Whore iet 
Was especially appropriate as a ot med 
Specifically, the usefulness of inter mpulsive 
sures is limited to a relatively pes = 
group. A measure of reading d to ide 
unspeeded vocabulary test were usec subjects: 
tify compulsive and noncompulsive ‘ca 
In 5 of 10 interest scales on the Str o betwee? 
tional Interest Blank, correlations ge were 
interest scores and grade point averag ulsiv? 
significantly higher for the near Et j- 
group than for the compulsive group. (1960) 
cation by Frederiksen and Gilbert occu" 
yielded similar results but only for ge engi 
pational keys most logically related S gen 
neering. Stricker (1966) examined t (s an 
erality of the findings with liberal dts indi 
Science students of both sexes. Resu re not 
cated that the compulsivity scales Hg coeffi- 
indicating subgroups with differen , 
cients, in 

A considerable number of uM d- 
analyses have been reported since 2 riables 
eriksen and Melville (1954) study. ouping 
that have been examined for a mns 
characteristics include demographic Bakers 
and biographical information blanks ( Starry: 
Leitch; * Medvedeff;? Rock;* Tesser, tors t° 

"Baker, B. L. The use of biographie Up 
moderate prediction and predict salary. kee 19 y: 
lished doctoral dissertation, Purdue bou a multi 

° Leitch, D. W. An initial investigation rivation 
dimensional validation model and the um dissert" 
homogeneous subsets. i a doctora v 
i iversity, 1965. TM 
T Cite "The utility of Su Er Unpublished 
ysis in the prediction of sales Hoa 1964. 
doctoral dissertation, Purdue IDEE of aca“ 

5 Rock, D. A. Improving the ss aaa Un- 
demic aievea by population db 1965. 
published dissertation, Purdue University, 


1 


l 


& 
1967, "hey, 1967), self-esteem (Korman, 
ables with ), self-report personality vari- 
Motivatio respect to motivation (Brown 35 
Wallach in risk-taking situations. (Kogan 
(French 1 , 1964), problem-solving styles 
rediger 965), knowledge and aptitude tests 
Superior’s, 1966, 1969; Steinemann, 1964), 
Hin, noi abiiy (Lawler, 1966), 
1962) m. levels (Goodstein & Heilbrun, 
acteristi notor skills (Kipnis, 1962), job char- 
zational 1. (Peterson, 1964), job and organi- 
Dation Cu E (Carroll & Tosi, 1969), occu- 
itchne oll & Gunderson, 1969; Dunnette & 
Weiss 1 1960), and job satisfaction (Dawis, 

oj LoIquist, & Betz, 1967). 

tha “ce variable. Toops (1959) proposed 
Is visis meaning of an individual's response 
"u steith to the homogeneous group, Or 
dition group, to which he belongs. In 
Tent yas the responses are relative to refer- 
Variable les (status and environmental 
Causa] es) which modify the meaning Or 
of an Potency of other variables. The weight 
van et variable is not constant, 
referrent n ls à mathematical function of the 
actions variable. The implication of inter- 
sion e suggests the use of moderated regres- 
te Iy quations, Referrent variables, however, 
tio, Mentioned in the differential validity sec- 
ustre cause of the emphasis by Toops on 
S hs and classification of profiles. The 
"S and environmental variables (educa- 


tio 

n H H 
variabla” etc.) examined as subgrouping 
bles,» les also can be termed “referrent vania- 


960 Grooms and Endler 
Medi ) formed three anxiety subgroups (high, 
ences he and low) and examined the differ- 
ionshi etween them with respect to the rela- 
rad ip between an aptitude predictor and 
terme point criterion. Grooms and Endler 
dig, d anxiety a modifier variable because 
ferential validities were found between anxi- 
dep ibgroups. A modifier variable is an in- 
r Pei variable which, when dichotomized 
oai mee leads to differential sub- 
vnb relationships. between a predictor 

liable and a criterion variable. Again, there 


GadTodifiee variable. 


9 off 
Š Brown, O., Jr. The utility of self-report variables 
Me predictive moderators. Paper presented at the 
“€eting of the Midwestern Psychological Associa- 


tion, Chicago, May 1968. 
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is no difference between anxiety as a refer- 
rent variable or anxiety as a subgrouping 


variable. 


Differential Predictability 


Differential predictability variables also 
identify groups with differing validities. In 
this case, however, the subgroups are formed 
on the basis of predictability; that is, the 
variable identifies subgroups of individuals 
who differ in predictability with respect to 
a particular predictor-criterion relationship. 
Thus, rxy for relatively predictable indi- 
viduals (small deviations from the regression 
line) is significantly greater than rxy for non- 
predictable individuals. 

Predictability variable. A predictability 
variable is a variable which is correlated with 
an absolute difference score between stan- 
dized predictor (Zp) and standardized cri- 
terion (Ze) scores (Ghiselli, 1956). The pre- 
dictability variable thus identifies those indi- 
viduals in a sample who deviate from a re- 
gression line and for whom the predictor is 


inappropriate. 

Ghiselli’s (1956) technique involves ob- 
taining absolute difference scores, |D|, be- 
tween Ze and Zp. Algebraic differences are not 
used since the model is concerned with accu- 
racy of prediction, and overprediction and 
underprediction of the same degree are consid- 
ered equal errors. The magnitude of |D| 
serves as an index of predictability; the 
smaller the |D|, the better the relation be- 
tween criterion and predictor. Correlates of 
the |D| scores are subsequently identified 
and used as “predictors of predictability.” A 
high positive correlation indicates that the 
validity of the predictor should be higher for 
those scoring low on the “predictor of pre- 
dictability” than for the entire group. The 
difference between the validity coefficients 
for the lowest (e.8 4) scorers and the highest 
(3) scorers on the predictability test is y 
d for significance. A significant differ- 
ence indicates that the predictor should be 
used only for the low scorers and a different 
redictor should be used for the remaining 
Eubjects. Johnson (see Footnote 2) refers te 
a variable with the above-mentioned proper- 
ties as an homologizer variable (a variable 


dar 


teste 
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Used to group scores of equal credibility or 
information value). 

Ghiselli (1956) demonstrated the potential 
usefulness of the technique in a study involy- 
ing prediction of Job proficiency of taxicab 
drivers, An occupational inventory identified 
those individuals for whom a tapping-and- 


dotting test was a good predictor of job pro- 
ficiency. 


In essence, 


as its own 
this technique, Scores are obtained for 
each individual in an experimental group. The 


ement of the pre- 
r Study, which im. 


plies the influence of a third variable on two 


others, 
Ghiselli (1963) found that the technique 
resulted in the identification of moderators 
highly situation-specific, Banas (see 

Ns however, found that transsitua- 
tional moderators (moderators that function 
to identify more and less predictable sub- 
groups when applied in different situations) 
obtained by Ghiselli’s technique do exist in 
the prediction of Performance of three classes 


re- 

Ghiselli and Sanders (1967) developed s 
dictability test from a self-analysis s wi 
tory that differentiated two M rc 
different patterns of heteroscedast doles that 

Blood?” found demographic brit ne rela- 
differentiated a job level-job rra (195 
tionship. In this case, the Ghise * multiple 
approach was modified in that 2 the re- 
correlation was examined to get demo- 
lationship between |D| and ied varia 
graphic variables. As a result, pm 
bles operated simultaneously as the Pls in- 
bility variable. The demographic enu 
cluded continuous and discrete v redictabil- 

Ghiselli (1960a) extended the e that it 
ity variable concept and demonstrate o tests 
is possible to differentiate which of vidual: 
gives better prediction for a given D | 
For two tests, Test 1 and Test 2, | n indi- 
|D»| are obtained, respectively. For a 
viduals for whom |Di| < |Ds|, scores “rately 
l predict criterion Scores more acc duals 
than do scores on Test 2; for all indiv re 
for whom |D,| > |De], scores on pm do 
dict criterion scores more accurately t ptain 
Scores on Test 1, Consequently, we hee 
[Ds] — [Di] for each individual and Ye 
have a set of Scores such that positive V pr 
indicate individuals for whom Test ‘ales 
vides better Prediction, and negative pro 
indicate individuals for whom Test i the 
vides better prediction. Correlates ouis 
difference between [D»| and |D;| are so ples: 
the correlates are predictability lone 
Critical scores on the predictability - b 
indicate whether Test 1 or Test 2 shou 
used for the individual. = 

Ghiselli (1960a) successfully utilize 
above-described procedure in three sod a 
using personality and interest inventor! 


d the 
udies 


s dso” 
the predictability variables. one 0 
(1965), however, failed in three ae uni 
improve prediction of college grade 


storni 
average with items from the cain 
Psychological Inventory or Minnesota ? "E 
Phasic Personality Inventory used as p 
dictability variables, Richardson sugges 


saci A 
; i n: 
1° Blood, M. R. Job level and job apres 
contingent relationship. Paper presented at ociation: 
ing of the Midwestern Psychological Assi 
Chicago, May 1967. 
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th 3 
of the f awe was due to the complexity 
he * ors involved in grade point average. 
Stable fae ig exists, however, that more 
dictability. ts would be obtained if the pre- 
of the | a variable were correlated with each 
correlate scores independently, rather than 
and with the difference between |D=] 
dictaba, It is possible to obtain the pre- 
1 is od aa when, for example, 
able aud ated with the predictability vari- 
e individ therefore, appropriate for some of 
the pre jt uals), and Dz is uncorrelated with 
appro, ictability variable (and, therefore, 
Ore. He for all individuals). Further- 
differen is possible to find correlates of the 
Predict ce between |D»| and |D;| when both 
Sibilities © are relatively inefficient. These pos" 
lea p. occur and therefore provide mis- 
ave f, information, when both Di and D» 
and 9 igh absolute values, for instance, 10 
em 7; o spectively. The difference between 
15 small and negative and would indicate 
Meee 2 is more appropriate. However, 
Oth et absolute values of each indicate that 

Pris are inappropriate. 

Resteq rant analysis. Marks (- t 
to ed that researchers direct their attention 


Situations in which indicated relationships 
iadrant relation- 


tps, In gestion, Hobert 
ss Dunnette (1967) claimed that there is 
Caes value in identifying 7 < Z) iñ 
Z> Z, licted (Zp = Ao an 

) and underprec! rather than 


the 


(1964) sug- 


9 
ud, Predictor test à 
eri rion scores. Both t 
Yield. qm are AME 
ing Class t 
als: Wig. i (high predictor-high. crite- 
tion), low hits (low predictor-low criterion), 
°Verpredicted (high predictor-low criterion), 
‘nd underpredicted (low predictor-high crite- 

tion), 

Hobert and Dunnette (1967) developed 
Wo moderators: one for the underpredicted 
he other for the high hits 


and low hits, and t 
And overpredicted. Moderators were devel- 


he predictor and 
at the median to 


ations of individ- 
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oped through an analysis of the items used 
to form the predictor composite; that is, 
those items which discriminated significantly 
between criterion score levels within a given 
predictor score level (e.g, between the low 
hits and underpredicted groups) were used. 
Hobert and Dunnette claimed that two mod- 
erators were operating, one for each level of 
predictor score, which enhanced the prediction 
of managerial effectiveness. 

Abrahams (1969), in a comment on quad- 
rant analysis, stated that mean differences on 
the predictor composite (e.g, low hits and 
underpredicted) are ignored. Since the mod- 
erator is composed of items from the predictor 
composite, differences found in item analyses 
reflect to some degree these predictor mean 
differences. The developed moderator variable 
may be predictive only of differences on the 
predictor variable itself. At best, the com- 
ponents of the moderator variable would re- 
flect a combination of differences, that is, 
actual differences between hits and misses and 
spurious differences due to predictor score 
differences. The items and variables showing 
such differences are related to the predictor, 
and it is doubtful that such a moderator has 
value beyond that already shown in the 
predictor-criterion relationship. In essence, 
as with Ghiselli's (1960b) use of items of the 
predictor of predictability test, the predictor 
items’ weights are changed. 

Similarly, McNemar (1969) demonstrated 
that when the items and scales of the origi- 
nal predictor composite are used in the mod- 
erator without the restriction that the moder- 
ator be uncorrelated with the predictor, the 
predictor itself would be a moderator. The 
demonstration revealed increased predictive 
efficiency, increased hit rate, and decreased 
percentage of overlap between high- and low- 
criterion groups. However, though the valid- 
ity coefficient of the predictor increases, the 
error of estimate remains unchanged when 
cases are screened out from the central por- 
tion of the distribution on the predictor vari- 
able. In addition, when the gain in hit rate 
is based on the total sample rather than on 
the sample after screening, the actual result 
is a loss in hit rate. i 

Applications of quadrant analysis have 
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demonstrated little success. Blood (see Foot- 
note 10) used quadrant analysis with 
level-job satisfaction d 
Were found. Berdie (1 
effects of intraindividual variability indexes 
on à predictor-criterion relationship, Again, 
there were few con 

trends. In both studi 


The 
Concept of quadrant analysis or any model 


involving off-quadrant consideration 
logical extension of Ghiselli’s (1956) tech. 


nique. Emphasis on underpredicted and over- 
predicted individuals, 


bining these individuals 


is a 


The overpredicted 
dividuals with D<-1.T 


see Footnote I), 
e group validation 


ituation, results did 
p in cross-validation. 


variability are related to pre- 
erdie (1961) used an index of 
intraindividual variability as a potential 
moderator Variable, The basic notion of the 


Concept implies that intraindividual vari- 
ability indexes determine subgroups for whom 


validation mod 


a multi-predictable group 
Bowling Green State V 


Shed doctoral dissertation, 
niversity, 1969, 


The 
more efficient prediction can be mal — 
specific hypothesis was that studen od 
variability of performance ets tak- 
mathematics subtests) was greater ber wou 
ing a test used for predictive iecit 
be less predictable than students perc 
havior was less variable. Again, the p? a pre 
is similar to Ghiselli’s (1960b) use rhe re- 
dictor as its own predictability test. ver, were 
sults of Berdie’s (1961) study, however; 
not statistically significant. : (1969b) 

A more recent study by Berdie en intra 
investigated the relationship M ph The 
individual variability and predictabi nship © 
evidence suggested that the par. 0 
Complex and consideration must be > on 
the individual’s level of men xes à 
tasks from which the variability in t the 
developed, the level of cepe e 
predictor and criterion, and the met 
in determining accuracy of m exte" 

Berdie (1969a) further studied t he vati 
sion of the Concept and found that t sont 
ability of an individual on one task 1S artain 
what related to his variability on pat 
Other tasks. Berdie, however, caution must 
the tasks on which conclusions are ma cue fo 
be specified. Ina sense, the results dre 
the existence of transsituational vatial 


use 


Moderated Regression Techniques 


yol 

yo 
Moderated regression techniques H d 
prediction equations for the total a if 
do not require explicit subgrouping gerated 
ferentiating of individuals. The mene, er 

regression equation contains a hig! n 
term such as an az cross-product A highel 
test is made to determine whether the equ 
Order term in the moderated oe that 
tion contributes to prediction beyon uatio” 
of the ordinary linear regression a: 
(without the additional higher-order hen] and 
Population. control variable. Gay contr! 
Carroll (1948) defined a popu a sub” 
variable as a variable that ee of # 
Populations in which the applica i for th 
multiple regression equation, eG multiple 
entire population, is inappropriate. at include? 
regression equation is developed = variable? 

Products and cross-products of the 


Á i 
P variable 
in a multivariate analysis. Any 
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the analys; 
Well psi may have population control as 
[x en dietar effects. 
e Bo vu (see Footnote 2) suggested that 
Toll (1948 “ables used by Gaylord and Car- 
rating in military status and degree of 
iem with the ratee, be referred to 
Johnson claim control executive variables. 
cong Claimed that the variables did not 
Serve ueniens variance but, instead, 
Should direct which additional variables 
oder in a second prediction stage. 
Soined th or variable. Saunders (1955, 1956) 
Cate bene term moderator variable to indi- 
Tedictj inuous variables that influence the 
able, Ve effectiveness of the predictor vari- 
“uation multivariate curvilinear regression 
which „D Volving cross-products is used in 
Constant 5 beta weights, instead of being 
Variables. are linear functions of moderator 
ainin "derator variable is a me: 
While z C integrity of the tota 
cag, Still maintaining a statistical 
tontin iduals membership in one of 2 
define ous, infinite series of subpopulations 
(Sauna by his score on the moderator 
the ong, 1955), Saunders (1956) treated 
dic zs product of the moderator and pre- 
Scores as a new predi 


ans of main- 
] population 
control on 


ctor in any stan- 


darg 
Rressi multiple regression technique. The re- 
x, Sion equation is y = y + Sti + Xb;Zj 


NC; ye 
we where a, b, and € are weights, *i the 
epa, and z; the moderators. À 
Unders (1956) reanalyzed the Frederik- 
and Melville (1954) data using the mod- 
*ssio regression. technique. Moderated La 
Validi n resulted in increases in DT? ictive 
i» "lity over the multiple correlation method 
diq 9f 10 comparisons. However, in no case 
i Moderated regression provide significant 
ovement over the coefficient for the non- 


o 
Mpulsive group. 


Erat 


and Katzell 


(1g; patrick, Ewen, Barrett, 
B 8) investigated “race” and “cultural 
Privation” (measured by factor scores) as 


y derators of success in various occupations. 

Do 2, the data did not strongly support 

vali hypotheses that tests are differentially 
id for different ethnic groups. 

hora (1966) d presented a generalized 
derated regression technique that allows 
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individual differences to emerge without the 
s of a priori conceptions. One set 
derived for each person on a 
d a second set for each pre- 
f weights are combined 


constraint 
of weights is 
dimension, an 
dictor. The two sets 0 


into a composite weight for each dimension 
criterion matrix. 


in the combined predictor- 
This individual differences model showed 


some improvement in prediction over the 
regular multiple regression technique. 


COMPARATIVE EFFICIENCY OF MODERATOR 
VARIABLE PROCEDURES 


Common to the alternative definitions and 
procedures is the notion of dividing a hetero- 
geneous population into homogeneous sub- 
groups on the basis of the variables affecting 
the relationship between predictor and cri- 
terion. Thus, interaction among variables is 
implied. Another implication is that homo- 
geneous subgroups, showing differential pat- 
terns of validity, may be isolated (Banas, 
see Footnote 3). Consideration of these impli- 
cations, in addition to the frequent inter- 
change of terms, has caused Banas to define a 
general moderator variable as a qualitative or 
quantitative variable that improves the use- 
fulness of a predictor by isolating subgroups 
of individuals for whom a predictor or set of 
regression weights are especially appropriate. 
Consequently, it is implied that the same 
moderator variable should be identified re- 
gardless of the procedure used. Several 
studies have examined the efficiency of the 
procedures relative to each other and to 
linear regression. 

Saunders (1956) compared moderated re- 
gression to subgroup analysis. Results showed 
that moderated regression was an improvement 
over linear regression. As previously men- 
tioned, however, the moderated regression did 
not provide an improvement over the validity 
coefficient for the noncompulsive subgroup. 
Parrish ? compared the predictive efficiency 
of moderated regression with that of linear 
multiple regression. Aptitude, interest, and 
personality scales were used as predictors, 


12 Parrish, J. A. A study of two non-linear meth- 
combining predictor tests. (PRB Technical 


ods of E 
Note 103) Washington, D. C.: Personnel Research 
Branch, AGO, Department of the Army, 1959, 1-21. 
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and combat efficiency ratings were used as a 


Criterion. Results indicated that moderated 

Tegression was not more effective than linear 

multiple regression, 
Johnson (see Footnot 


€ 2) compared mod- 
erated regression techni 


groups (selected on 
variable) than for t 

Zedeck, and Smith: 
regression, multiple 
956) procedure, and 
Variables effe 


ctive as 
moderators in subgroup analysis and pre- 
diction of Predictability were not effective in 
moderated 


& Dunnette, 
ed that Ghiselli’s (1956) 
successful than quadrant 
ng moderators, 


Procedure was more 
analysis in identifyi 

In general, mode 
been successful ini 


equa- 
1956) describes the multi- 
surface with a model that 
predictor (including mod- 
The techniques leading to 
i alidity or differential pre- 
ili er each subdivision of the 
n to have a different linear 
een predictor and criterion. 
of any nonlinearity in the 
To Zedeck, S., Cranny, ç, J., Vale, C. A., & Smith, 
PC X comparison or "joint moderators? in threc 
prediction techniques, Unpublished 
Bowling Green University, Ohio, 1969. 


relationship betw: 


manuscript, 


d the 
relationship between the moderator u- ig: 
Predictor-criterion combination ee: be- 
nored. As a result, close correspon ralitativ® 
tween results of quantitative and d iely on 
techniques would be dependent S mpositio" 
the skillfulness in choosing the yet 
of the subgroups. Tn addition, statis risons- 
methodological problems limit compa 


ES 
sCHNIQ 
PROBLEMS wit Moprrator TEC 


T lems 

Statistical and Methodological Prob podo 
et 

Several general statistical = o differt? 

logical problems, particularly in Phechnique? 

tial validity and predictability roaches 

are apparent in the moderator aPP al 


5 1 t » 
The first problem, which is pertinen cross 


moderator approaches, is that of 
validation, 1 result? 

Some investigators have reported paws 
that were 


not cross-validated (ess y 
et al., 1967; Murray"), Banas and "dem 
presented a Study that claimed to 

strate the need for cross-validation. para 
of foremen was trichotomized into su» als? 
based on job Satisfaction scores uo n 
trichotomized randomly. (Whether i 
dom assignment of foremen actually ee 
a different subgroup composition ce epin 
based on job satisfaction was not ei pê 
Three cases, each with different comb d 
of predictors, were examined to de dic jo? 
whether job satisfaction moderated m, t the 
of performance. Results indicated silo” 
Patterns of validity coefficients were 


se 
for both types of goce pem 
quently, Banas and Moore conclu ator 


cross-validation 


H 4 ode 
is necessary in m 
research, 


nt for 
This writer agrees with the argume d 
ctoss-validating research. The results 
conclusions of the Banas and Moore 


sfat” 
-— 
14 Murray, S. L. Moderators of training 54 er 


a 
(se 


ap 

4, s. in 

tion and training performance elatione dd 

Presented at the meeting of the Rocky exic? 
P 


Sychological Association, Albuquerque, New 
May 1969, sa efet A 
15 Banas, P. A., & Moore, L. Moderating fiectiv’ E 
Job satisfaction and evaluating moderator n ic^ 
ness. Paper presented at the meeting of t M 


s icago. 
Western Psychological Association, Chicago; 
1968, 


pat | 
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Foo , 

y ps study, however, are confounded 
Televant to Ge which is especially 
tial predict. iege validity and differen- 
cerns the " ility studies. This problem con- 
Validities est for demonstrating differential 
Ptedictabilitv nt qualitative subgroups Or 
used three y subgroups. Investigators have 
erential y. er for demonstrating dif- 
Coefficient A genios (a) Significantly different 
or Siem etween subgroups; that is 7xY 
"xy for pues 1 is significantly different from 
Coefficient: group 2; (5) patterns of validity 
Significa, s reveal some coefficients to be 
or Sa different from zero; that is, 7xv 
ent far n 1 and 2 is significantly differ- 
Not si m zero, but rxy for Subgroup 3 i5 
the P nai different from zero; an (c) 
erent idity coefficient of a subgroup 1S dif- 
t from the validity coefficient of the 
Ernte one that is, rxy for Subgroup 1 is 
Try f than ryy for the total group, whereas 
Tx Or Subgroup 2 is equal to or less than 
Yr for the total group. 
Apc a eed 
Way, Th is more powerfu p drm 
Coeffici he significance from zero of subgroup 
io Iclents is not fundamental to the ques- 

n of differential validity. Consider the case 
Er the coefficient for one subgroup is 40, 

for the other subgroup the coefficient 1s 
o» both significantly different from zero. 
T consider the case when 42 is significantly 
e rent from zero, but .40 is not. In either 
is ent, can we conclude that the predictor 

Useful for one group and not for another? ; 
Es is, can we conclude that the moderator 
ariable has yielded differential validities? 
s Ne Banas and Moore (see Footnote 13) 
tudy, in addition to others (e.g, Daw!s et 
Al, 1967; Kirkpatrick et al. 1968) are ex- 
amples of such statistical reasoning. On the 
other hand, if two validity coefficients are 
Significantly different from each other, then 

ç predictor can be more appropriate for 
One subgroup than the other. 

As for the third way, this author does not 
know of a statistic that tests for the differ- 
ence of a subgroup from the coefficient for 
the total group- Pervin (1967), Prediger 
(1966), and Grooms and Endler (1960), for 
example, formed conclusions based on such 
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“analyses,” but did not report the statistical 
test that was applied. Logically, this type of 
test would be most desirable to determine 
whether moderators are adding to prediction. 
Yet, if we compare the two validity coeffi- 
cients, the best we can do is to “eyeball” the 


differences. 


Perhaps different statistics should be used 


in assessing differential coefficients. McNemar 
(1969) has demonstrated that different valid- 
ity coefficients do not necessarily imply that 


criterion scores for one group can be pre- 


dicted more accurately than for the other 


subgroup. The subgroup with the highest 
validity coefficient does not necessarily have 
the smallest standard error of estimate. 
Consequently, Einhorn and Bass (1969) 
argued that the differences in errors of esti- 
mate for different subgroups, in addition to 
validity coefficients, should be examined. 
Another way to assess differential validities 
is to examine expectancy tables for the dif- 
ferent subgroups (Guion, 1965, 1966). Dif- 
ferent expectancy tables require different 
evaluations and interpretations of the data. 
A third general problem concerns the num- 
ber of different subgroups that should be 
formed. The possibility for forming an endless 
number of subgroups exists, particularly when 
examining biographical information blanks or 
demographic variables as moderators. Infinite 
subgrouping is related to the problems of 
sample size (subgroup sample sizes become 
too small on which to perform significance 
tests) and stability of the differences. Solu- 
tions are to consider several moderators 
simultaneously as the basis for subgroups or 
gle multiple-moderated-regression 


to use a sing 

equation, which maintains the total sample 
in the analyses. 

blem pertains to the validity 


coefficient of the original total sample. Im- 
proving upon à validity coefficient that is 
already relatively high is both difficult and 
somewhat impractical. While the basic pur- 
any moderator variable approach is 
ve validity in situations in which 


size 
A fourth pro 


pose of 


to impro 
predictors are poor, whether a gain is worth 


the effort involved in applying a technique 
a matter to be decided by the individual 
selection decisions or by a 


is 
making the 
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decision process that includes the notion of 
utility. 

A fifth general problem is that the effect 
of the potential moderator variable as an 
independent predictor often is not reported 
(e.g., Brown, see Footnote 9; Murray, 1969, 
see Footnote 14). Applying the potential 
moderator variable as an independent pre- 
dictor in an ordinary multiple regression 
equation can produce equally good results 
and thus would eliminate the endless sub- 
grouping and subgroup sample size problems. 
Also, there would be less claims for the 
ubiquitous moderator in the research litera- 
ture. Hence, the correlations, slopes, and stan- 
dard error of estimates Should be examined 
for the following three regression equations: 


y=a+ bx, [1] 
[2] 


where z is the potential moderator but is 
treated as an independent predictor, and 


y — a+ bix + ba, 


YS a+ bix + boz + baxz [3] 


(moderated Tegression equation). If Equa- 
tions 2 and 3 are significantly different from 
Equation 1, but not from each other, then 


the variable is an independent predictor and 
not a moderator variable. 


Operation of Moderator Variables 


The literature concerning moderator vari- 
ables has focused primarily on demonstrating 
the existence of moderators rather than 
explaining how they operate. A common con- 
ception is that moderators operate by sorting 
heterogeneous aggregates of individuals into 
homogeneous subgroups (Berdie, 1961; John- 
Son, see Footnote 2; Saunders, 1956). The 
Subgroups are intended to be homogeneous 
with respect to error and psychological struc- 
ture, with the magnitudes and patterns of 
reliability and validity coefficients varying 
from group to group (Baker, 1967). Banas 
(see Footnote 3) stated, however, that sub- 
grouping is not simply homogeneous grouping 
but is grouping restricted usually to the 
extreme scores on the moderator variable. 
According to Ghiselli (1963), evidence 
indicates that moderators differentiate those 


f 
individuals in a group for whom ped. 
measurement or prediction is small er 
group for whom it is large. This anmi E 
presumes that individuals can be divide um 
clear and distinct classes. Ghiselli e 
that, in actual practice, moderators eA uals 
individuals along a continuum. eee 
are sorted into separate classes, and à § ame 
consists of individuals who fall at the S 
point on the continuum. ratot 

Another possible explanation of mode ne 
effects is that the common elements, , vari 
account for the correlation between two qual 
ables, differ from individual to yt i, 
rather than from group to group (G s oF 
1963). Rather than sorting into glas the 
groups, each falling at a single point O ater 
continuum, there is sorting into class ard 
vals. Such a notion presumes that erro in- 
measurement varies from small for mi 
dividuals to large for others. en - 
error scores would carry less weight in duals 
mining observable scores for some indivi ndi- 
than for others, However, a necessary eo rro 
tion is that individual differences in sd 
SCOres possess reliability over parallel tes a 

Likewise, error of prediction is smaller E 
test validity higher for individuals at 
extreme on the moderator continuum, an». 
the other extreme, error of prediction is po 
and test validity lower (Ghiselli, 1963). dic- 
sequently, the weight a test carries in p l 
tion varies from individual to indivi’, 
With respect to validity, the function of ual 
moderator is to predict for a given indiv! cri- 
the weight a test carries in determining the 
terion performance, However, nothing in He 
concept indicates the relationship of the 6j or 
vidual’s weights to the criterion an 
test scores. sta US 

The above conceptions are appropriate ten- 
pending on the given situation. Little ber 
tion, however, has been given to the ma në 
matical operation of moderators. In 9 


. of 

75 Interestingly, though the notion of ro ot 
measurement is used to explain the opeta ten i 
moderator variables, error of measurement ]y is 
respect to the moderator itself is ignored. Ra it 
the reliability of the moderator reported, an re- 
Would not be surprising if large error of MET 
ment was responsible for moderator results 
holding up in cross-validation. 
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Stud 

insiceesctudee (1969) developed and then 
Mathematical tested the implications of the 
Ictability bases of Ghiselli’s (1956) pre- 
Ps variable. The study (Conger; 
able Sh that the predictability vari- 
Some trait s deviations from the norm on 
trait itself rather than deviations from the 
trait), (e.g., presence or absence of the 

T , 
su Rud determinant of the operation 
ciency of ors, as in the assessment of ef- 
of variab] moderators, depends on the type 
h ORC Terr or continuous, examined. 
Involye T of moderator variables 
and disco e distinction between continuous 
argument ntinuous qualitative variables. The 
in made by Ghiselli in his 1968 study 
Moderato. unpublished study " 1S that the 
eterogene does not function to separate à 
€neous neous population into a more homo- 
discontinuous A true moderator is mot à 
reniiate qualitative variable that dif- 
Walitative subgroups of individuals who are 
ned different, but is à continuous 
te al] s variable, and individuals distrib- 
di s ong its continuum. For qualitatively 
Can oe groups the pattern of correlations 
theo, e different, and classic psychometric 
nde, still is appropriate, for there is no 
contin ot effect. If groups differ along a 
theor uum, however, then a new psychometric 
ect js is needed to elicit the moderator ef- 
Or 65 w essence, there is à continuous (linear 
the rvilinear) relationship between scores on 
Roce tee and some predictor-criterion 
simi] combination, This conceptualization is 
de. to Saunders’ (1956) position that a 
anal rator variable is a generalized case of 
S ysis of covariance, with the basic param- 
ae score on a continuous variable rather 
group membership. 
pA Situations in which discontinuous quali- 
e ve variables have been used as moderators 
me subgroup analysis), à heterogeneous 
heen is divided into more homogeneous 
ual SOUS: A complex aggregate of individ- 
S is simplified, and there is no interaction 


17 Ghiselli 3 

T ` Ghiselli, E. E. The prediction of human be- 
Re Paper presented at the meeting of the Ca- 
adian Psychological Association, Calgary. June 


1968, 


Pp — 
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between the moderator and predictor-criterion 
score combination. 

Marks (1964) also has related off-quadrant 
techniques to analysis of variance. A signifi- 
cant interaction between predictor and cri- 
terion indicates the possibility that a moder- 
ator is operating and that the off-quadrants 
should be examined. 

Off-quadrant techniques are explained in 
another way. Lykken and Rose (1963) sug- 
gested that the predictability of the criterion 
(Y) from the predictor (X) varies as a func- 
tion of a moderator (Z), which may be un- 
correlated with either Y or X. The function 
relating Y to X differs among individuals in 
the sample. One equation, for instance, 
Y — aX, is appropriate for the group close 
to the perfect regression line, whereas an- 
other equation, for instance, y-b/X, is 
appropriate for those further from the regres- 
sion line (the off-quadrant cases). The pre- 
dictability of Y from X varies as a function 
of Z.Z isa function of the likelihood that 
an individual belongs to the Y — aX group; 
that is, Z is à discriminant function. 


Hobert and Dunnette (1967) applied the 
(1963) explanation to the 


Lykken and Rose 
quadrant analysis approach. Hobert and 
Dunnette (1967) claimed that equation 


Y = aX is appropriate for the low hits and 
high hits, whereas the underpredicted and 
overpredicted are described by Y = b/X. In 
this case, Z is correlated with Y. Also, two 
equations are necessary to describe the off- 


quadrant cases. 
Zedeck (see Footnote 11) applied discrimi- 


nant analysis in the multipredictable group 
validation model with little success. The dis- 
criminant variables (moderators) identified in 
the experimental groups were not effective in 


the cross-validation group. 
Consequently, there are several alternative 


suggestions as to how moderators operate. 
Yet, moderators are as elusive as suppressor 
a fact, we may consider a sup- 


able as one type of moderator, 
reverse is not necessarily true. A 
variable is correlated with the 
predictor and uncorrelated with the criterion, 
whereas à moderator is uncorrelated with 
both the predictor and criterion. 


variables. Tr 
pressor vari 
though the 
suppressor 


] 
HY 
i 


| 
| 
i 
f 
} 
i 
if 
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Identification of Moderator Variables 


À review of the moderator variable litera- 
ture indicates that a considerable number of 
scales and variables function as moderators. 
The problem of readily and systematically 
identifying a moderator variable, however, 
still remains unsolved. Banas (see Footnote 
3) distinguished between the rational and 


empirical approach to the identification of 
moderators, 


The rational approach to the identification 


of moderators involves intuition, hypothesis- 
forming, and/or accident, Saunders (1955) 
realized the problem of identifying moder- 
ators and suggested four procedures: 
Start with a good hypothesis; 
few cases intensively and k 
of which interpretation se 
other variables; (c) 


Within which correlations are significantly 
different; and (d) test the interaction vari- 
ance of a pair of items against a criterion. 
"oops (1959) suggested that moderators are 
found by looking for “ig - e provided” 
statements, Frederiksen and Melville (1954) 
examined compulsiveness as a moderator, for 
it “appeared to be useful” as the basis for 
a hypothesis, 

Conger (1969) suggested four a 
for looking for moderators: (a) 
differences between current 
standardization 


ems to depend on 
look 


pproaches 
Look for 

subjects and 
look for differ- 
tics and formats of 
or subject character- 


These methods of trying 


ses, however, are time consuming, inefficient, 

and costly because Of the considerable 

number of possibilities that exist. 
The techniques by Ghiselli (1956, 1960b), 

Berdie (1961), an 

(1967), 


out specific hypothe- 


dered empirical 
Banas (see Footnote 3) men. 


predicted criterion, (c) as- 
dual variability, or (d) as- 


infrequency of 
response inconsistency, 


sessing intraindivi 
sessing statistical response 
and/or 


Though the 


la: 
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variable 
techniques approach objectivity, one ee 
at a time is investigated, and, therefore, 
procedure is not efficient, that 
Consequently, statistics or pannus mod 
systematically or mechanically identi y since 
erators are needed. The task is Mr 
not enough evidence has enim pr 
make it possible to state any genera ira 
ciples about the nature of items or 
that function as moderators. sts t0 
There have been few empirical oot 18 
solve the identification problem. ae 
attempted to develop a program for the large 
tion of moderator variables from a E 
number of variables. The program a oten- 
signed to identify moderators from 4 Sp the 
tial moderator variables without mts 
time to compute actual correlations. es ane 
based on the joint distribution, in 3 X iterion 
tingency tables, of predictor and cr used 
Scores that had been trichotomized, were tech- 
in place of correlation coefficients. The rtion 
nique was unsuccessful because of disto: 
of the indexes. loped 
Rock, Barone, and Linn (1967) deve tep- 
a computer program for a moderated ators 
wise prediction system. Potential made o 
are examined simultaneously orina sep 
manner. Klein, Rock, and Evans (1968) | y: 
limited success with the program in one SUC” 
They favored the stepwise system ie an 
torial designs because the latter requit hic 
impractically large number of groups, W an 
are difficult to obtain in educational í i: 
industrial settings, Tn addition, factoria 
signs lose information by not treating 
moderator as a continuous variable. use 
Zedeck (see Footnote 10) attempted to ari 
discriminant analysis and analysis of cové 
ance to identify moderators from amonit 
potential variables in the multipredict? 
group validation model. The assumption ES. 
that the significant high weights of sar ae 
Criminant function were operating as -—. 
tors. Cross-validation of the moderators 1 ful. 
tified by the approach proved unsuccess 6 
The need for systematic cide en 
Moderators is still evident. Abrahams (19 
'5 French, J. W. 


in- 
its 


the 


yat ot 
A machine search for nies 
Variables in massive data. (Office of Naval Been 
Report, Contract Nonr 2A OO NR. 15 3961. 
Princeton, N. J.: Educational Testing Service, 1 
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Ma s 
E ny Statement, claimed that investi- 
Prediction € looking elsewhere to enhance 
Only as i moderators were being found 
the autho, med as chance permitted. Though 
e a lr no evidence for reduction in 
E echar moderator studies, the utility 
tematic ta LM will diminish unless a sys- 
moderators nique is developed to identify 
Possible 2 from a large mass of data. It is 
d tims = hey (1963) cautioned, that 
ind mode nd effort required to develop and 
Spent im Tators might be more productively 
the sort ig reliability and validity of 

at follows from classic theory. 


Urr 

Pine cux 

TY or MODERATOR VARIABLES 
moder- 
of the 


Iness of 
hich 


Cor i " 
hor ie ratus of the problems with 
Utility ree raises the question 
à Conce: e approaches. The useful 
validity S evident in situations in W 
t with coefficient is low, but, where at 
“Curate 1 certain individuals, reasonably 
May be Prediction of criterion performance 
n one from scores on the test. Selec- 
ti n ie oa ts can make use of different predic- 
lig, ĉQUations and different patterns of va- 


les $ M 
for different links in the prediction 


Nod 

el 

(195,5 Suggested by Guetzkow and Forehand 
the 


hup) and Dunnette (1963). Though 
eif, x of studies concerned with moderating 
blems limit 


Ci 

ir anh the results and pro 
d ity. 

Sup teh predictions are improved for one 
Stoup D there remains usually another sub- 
'Sheq ne whom predictive efficiency is dimin- 
or ," The implications are to eliminate them 


o n 
find new predictors for them. Since 
x, and religion 


ei 


Vari 
of anbles such as race, color, se 
y ES found to be moderators, the former 
eat, ation is illegal and unethical. ‘Therefore, 
fone attention must be given to the search 
valid predictors for all groups in the 
wlation, 
tion citz’s (1966) “sequential moderator” 
Atop might enhance the utility of the moder- 
E and alleviate the problem of 
JN Ing those individuals for whom the pre- 
ita r appears to be inappropriate. The typ- 
validation approach calls for validating 
test and examining the influence of moder- 


no- 
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ators at a particular point in time. It is recog- 
nized that the pattern of individual differ- 
ences associated with performance changes 
with practice over time (Fleishman & 
Hempel, 1954). Thus, the time at which cri- 
terion (e.g., performance) measures are taken 
should be varied and, consequently, the pre- 
dictor (e.g., measures of motivation or per- 
sonality) validated at the appropriate time 
as dictated by a moderator (e.g., aptitude). 
We might establish from sequential moder- 
ator research when predictors are most valid 
for different subgroups. 
Dodd, Wollowick, and McNamara (1969) 
tested the sequential moderator concept. Re- 
sults indicated that whereas a personality 
measure was related to salary in early years 
for a group with high training grades, per- 
sonality was unrelated for a group with low 
training. grades for the same period. How- 
later years, personality predicted 
th high and low training- 
grade subgroups. Dodd et al. concluded that 
training grade or task difficulty is a sequen- 
tial moderator. We might consider also that 
years and training grades were “joint moder- 
ators" (as discussed below). However, the 
notion suggests that we do not have to elimi- 
nate subgroups for whom it is initially dif- 
ficult to predict. In the long run, the success 
dicted individuals may 


of predicted and unpre 
be equally predictable. Furthermore, having 
f success can prevent 


al from leaving or 


ever, in 
equally well for bo 


long-range estimates o 
individuals. with potenti 


being eliminated. 
Another way to increase the utility of the 


moderator concept is to consider the opera- 
tion of joint moderators. Joint moderators 
are defined as two Or more variables that 
interact to influence the relationship between 
predictor and criterion (Zedeck et al., see 
Footnote 12). Fredericksen and Melville 
(1954) indicated that considerable improve- 
ment in prediction was obtained for a sub- 
group considered noncompulsive based on two 
different indexes. Stricker (1966) formed four 
subgroups from the combination of the two 
dichotomized compulsivity measures. In this 
case, the joint use of the two vatiables did 
not enhance the moderator effect. This result 
may be attributable to the fact that both 
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moderators 
construct, 
Klein et al. (1968) 
Suggested by Rock et al. 
the effectiveness of joint 
prediction of law school 
(age, major field of study, 
tion by college for 


were measures of the same 
used the procedure 
(1967) to examine 
moderators in the 


used to 
grouping pro- 


three Broups. 

Zedeck et a]. (see Footnote 12 
the operation of two joint moderators, study 
habits and anxiety, in the moderator models 
of subgroup analysis, prediction of predict. 


) examined 


tributed to the ineffectiveness of the joint use, 


Finally, the problem of subgroup sample size 


1S more evident in joint moderators than 


In general, the results 
Tesearch further 


dis- 
ntially considers the 
of moderators (dis- 
However, as already noted, 
the results were unclear, Perhaps using factor 


analysis to diminish the number of potential 
variables and, then, factor scores to form 
predicted, Overpredicted, and underpredicted 
groups would enhance prediction, It is pos- 
Sible that the factor structure Varies con- 
Siderably from group to group. 

"T'esser et al, (1967) computed factor scores 
derived from life history items for their sub- 
jects, but then placeq them into homogeneous 
groups based on an index of profile similarity 


+ ability: 
rather than based on degree of predicts a 
The results were inconclusive because i ack 
Significance in cross-validating; how: DAG 
of significance in cross-validation Mid sizes 
been because of extremely small samp 


CONCLUSION 


` re- 

The utility of the moderator variana 
search is limited by statistical prob  tistica 

the limited understanding of the be a 
Operation of moderators, and by vi ntifict" 
Tapid systematic approach to the i the St 
tion of moderators. With respect to en the 
tistical problems, comparisons ges intet 
standard error of estimate, slope, ae pif- 
cept for the subgroups should be ma correla” 
ferent statistics are needed since 4 attern? 
tion coefficient can indicate different ps non 
of relationships, which are a result 0 tial 
normal distribution, skewness, or p odologict 
in range. Perhaps, to alleviate metho model 
Problems resulting from subgoupme i group 
ated regression equations for the total £ d 
Can be extended to include squat owing 
other higher-order terms, thus 27 iiy 


:nearitY: 
detectability of other forms of nonlin taiem 


Finally, there is the semantics Proa 
The term moderator is used ambig" 
in the literature, As demonstrated, wt 
may refer to control or population. ue i 
or interactions, and/or differences in P used: 
ability, depending on the procedi eig jn 
Though the terms do not have entir the 
dependent meanings, we should keep 


ings 
i: aning 

mathematical and methodological me 
distinct. 


ples: 
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DESENSITIZATION: 


SOCIAL / 
EFFECTIVENESS OF 


AND COGNITIVE FACTORS UNDERLYING THE 


WOLPE'S PROCEDURE 


WALLACE WILKINS? 


University of Washington 


Recent evidence suggests that Wolpe’s theoretical formulations are inadequate 
of the therapeutic approach of systematic 


in accounting for the effectiveness 


desensitization. Of the element 


s traditionally 
muscle relaxation nor t| 


included in the desensitization 
he construction of a 


procedure, neither training in 
ant scenes nor the concomitance of instructed 


graded hierarchy of fear-relev 
imagination to muscle relaxation 
instructed imagination by itself 
zation procedure. Further, the e 
to be due to the traditionally 
relaxation. and anxiety, but rathe 
patient-therapist relationship an 
therapeutic gain, infor f 
attention, and vicarious learni 
instructed imagination. 


is the 


The 
pe Use of systematic desensitization as 
gro Y Chotherapeutic treatment method has 


W oye è x. 
^ rapidily since its introduction by Wolpe 


(los 
Ua... and currently enjoys widespread pop- 
alth disciplines. 


r 
L ET among many mental he n 
Ness E (1963) has documented its effective- 
nq n treating a wide variety of disorders, 
the Rachman (1967) thoroughly reviewed 
Wai eh on systematic desensitization 
n, ue at that time, described the poccat 
fecti € appealing theoretical rationale for its 
tio Veness, A number of recent investiga- 


el "S have been reported that, while not dis- 
alr f the procedure, 


do Ming the effectiveness O u 

u Rol into serious question the theoretical 

Vey. Dinnings from which (1958) de- 
Ped this technique. 

Em e most popular form 

Dloys muscle relaxation 


Wolpe 


of desensitization 
as an inhibitor of 


& Wish to thank Hans O. Doerr for his significant 
qu ttibutions in the development of this article, Re- 
is ests for reprints should be sent to me SLB SY who 
aad at Department of Psychology, University of 
tami, Coral Gables: Florida 33124. 


d cognitive 
mation feedback of success, 
ng of the contingencies of be 


are necessary conditions for treatment success; 


only necessary element of the desensiti- 


ílectiveness of the procedure does not appear 
stated mutual antagonism between muscle 
r due to social variables involved in the 


variables involving expectancy of 
training in the control of 
havior through 


fear or anxiety, and, according to Wolpe, the 
stages of the procedure are (a) training in 
deep muscle relaxation, (b) arranging fear- 
provoking scenes into a graded hierarchy, and 
(c) the instructed imagination of the hier- 
archy scenes concomitant to a state of muscle 
relaxation. Recent studies, taken together, 
suggest that of the three above conditions, 
none are necessary for successful desensitiza- 
tion. Indeed, the only necessary condition 
appears to be imagination of fear-relevant 
scenes not necessarily arranged into a hier- 
archy and not necessarily concomitant to 
muscle relaxation. Thus, Wilson and Smith 
(1968) reported success in severelv neurotic 
subjects where treatment consisted of relaxa- 
tion contiguous to the imagination of free- 
association scenes that were not organized 
into a hierarchy. Certainly, individually 
tailored hierarchies are unnecessary, as Emery 
and Krumboltz (1967) found no difference in 
anxiety reduction whether they employed in- 
dividualized hierarchies ora standard hic 
archy developed by the investigators prior to 
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interviewing subjects. Leitenberg, Agras, Bar- 
low, and Oliveau (1969) also demonstrated 
fear reduction using a standard nonindividual- 
ized hierarchy for all subjects. Rachman 
(1968) presented arguments that training in 
muscle relaxation appears unnecessary, as suc. 
Cessful fear reduction has been demonstrated 
With experimenters who are fairly inexperi- 
enced in the technique of muscle relaxation 
and with subjects who receive only perfunc- 
tory relaxation training. Also, Garfield, Dar- 
win, Singer, and McBrearty (1967), Cooke 
(1966), and Ritter (1968) reported successes 
with in vivo desensitization where subjects 


sions. Lazarus (1965) 


reported successful 
cases where subjects ar 


€ instructed to in- 
ncomitant to imagin- 
Scenes. Davison (1966) 


uscle relaxation and anx 
lety are not mutually 


antagonistic, since anx- 
ed while a person is in 
produced muscle relaxa- 
and Raines (1966) dem- 


Posure to less fear 
(personal communication, 1966) also rep 
studies in which imagin 
of fear-provoking Scenes was used without 
training and without arranging 

them into a graded hierarchy. 

These studies strongly suggest that Wolpe’s 
theoretical formulations are inadequate in 
identifying the critical variables involved in 
treatment Outcome. Of the elements included 


in Wolpe's procedure, instructed visualization 
of hierarchy 


orted 
ation 


communication, 1966) in- 

instructed. visualization 
may also be a sy icient condition for thera- 
peutic gain. The remainder of this article 
focuses on fac 


tors Operating in the desensitiza- 
tion situation which research evidence indi- 


: cting the | 
cates are highly influential in affe 
outcome of treatment, 


j: and the 
Expectancy of Therapeutic Ge? nfear Be 
Therapist as a Reinforcer of Not 
havior 1968) is 
A study by Wagner and cute S as 4 
suggestive of the effect the wem ah did "0t 
social reinforcing agent. “Treatment hs 
focus on fear; instead, subjects helk ap. 
liberal amount of compliments pe verbal 
pearance and attitudes. When giv ake, hey 
reinforcement for approaching a $n jor. 
showed reduction in avoidance pex cant 
Leitenberg et al. (1969) found gr con 
differences in fear reduction perm 74 
trol subjects and subjects who rec Ü niet" 
laxation contiguous to Mic mee exp A 
archy scenes, instructions forming a! for sme 
tion of therapeutic gain, and presa Me 
improvements, However, no differe! and su 0 
found between no-treatment controls Ay E 
jects who received relaxation aH. ioni 
imagination of hierarchy scenes, ins ner? 
designed to eliminate expectation veri nt? 
peutic gain, and no praise for impro ombin 
The authors argued that since the E: d 
tion of relaxation plus cacao ae 
archy scenes did not produce sign! he ma. M 
ferences from the control group, d n M. 
therapeutic benefit from desensitiza acC? 
due to the instructed expectation we 
or verbal reinforcement for small a pen 
These two effects, expectation di in 
peutic gain and reinforcement for ai gu 
provements, were separated by eas in a 
Leitenberg, Moore, and Wright (1 zat ra 
2 X 2 factorial design such that one ] insti 
ceived both, a second group cp. uti 
tions forming an expectation ar p soa 
gain, a third group received po rec M 
improvements, and a fourth EA relaxat 
neither. All groups also receive iy sceny 
contiguous to imagination of Er EA "je 
Although all groups showed ppt only "ic 
analysis of variance indicated ry theraPe y 
factor involving expectation Tube for E. 
gain had a significant effect; P factors W° 
Provement and the interaction jill 
not significant, a that the § ji 
From these studies it appears eetan 
of the therapist to develop an 


P 


therapeuti ^ 
| Oring a Ic gain and to act as a social rein- 

a large Sent for improvement contributes in 
Vatiables Part to therapeutic gain. These 
Dear O be. the therapeutic relationship ap- 
ave an po powerful in that they can 
| Wing ver ediate and long-term effect fol- 
y minimal contact with the thera- 


bist (g 

Migs Kahn & Baker, 1968; Lang, 1969; 
| Rardin, "en Migler & Wolpe, 1967; and 
Shsitizati i for procedures involving de- 
Th on with minimal therapist contact). 

e Ro 

Pro s of Information Feedback 

, “rovidi . 
Vis EE à subject with information about 
h rmance has therapeutic effects, as 


sl 
yo by Lei 
Wright eitenberg, Agras, Thompson, and 
| Skeq (1968). Two phobics were merely 
ety- 


Dto hs remain in the presence of anxi 
Aut, $ Stimuli until they felt anxious. The 
Kiven p found that when the subjects were 
ha Pim. information as to how long they 
th m de in the presence of the stimuli, 
Derg, ma 1 of exposure time increased and 
areq Spes variability decreased as com- 

n. p. trials when this information was not 
tect, raise added to information had little 


Sn 
by Aie fearing subjects in studies conducted 
Sou ir ns and Ray (1967) heard heartbeat- 
Present noises increase in tempo during 
bss ee of shock stimuli and remain at 
Stimy level during presentations of snake 
the e 1. In reality the tempo was controlled by 
tol “Perimenter; however, subjects who were 
heart at the noises were from their own 
for ui who were, in effect, receiving 1N- 
Bresse that they were not aroused during 
te Rees of snake stimuli showed greater 
i a in avoidance behavior than did 
Nis eit who were told that the sounds were 
, In he oo noises. 
veral desensitization procedu 
bagp ^. Possible sources of information 
te, ce may contribute to change in fe 
DË his ion. Of these, the subject’s perception 
arq S Own behavior, progression up the hier- 
Cussed and physiological reactions are dis- 


ire, there are 
feed- 
fear 


| È * Subjects perception of his own be- 
"s or as information feedback. Bem (1967) 
DN marshaled evidence that self-reports of at- 

des and feelings are; in part, a function of 
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the self-observation of overt behavior. A 
self-report of an internal event is viewed as 
an inference from observing one’s own overt 
behavior in the presence of accompanying 
situational variables—subjects are using the 
same information (overt behavior) that an 
outside observer would use in making in- 
ferences about a subject’s attitudes or emo- 
tions. For illustration, Bandler, Madaras, and 
Bem (1968) applied shock under conditions 
where subjects were either instructed to escape 
or endure the shock. Even though shock in- 
tensity level was the same, subjects rated es- 
caped shock as significantly more painful than 
endured shock. The authors offered the inter- 
tion that the reports of pain were con- 
trolled by subjects’ perception of their own 
overt behavior—if they observed themselves 
escaping, pain was reported to be greater 
than if they observed themselves enduring. 
This interpretation is applicable to the de- 
sensitization situation: If a subject endures 
the imagination of a scene, he should con- 
clude that he is more comfortable and should 
report less anxiety than if he observes himself 
signaling to terminate the scene imagination. 
A variety of factors may be influencing sig- 
naling behavior (e.., reinforcement for not 
signaling, patient's susceptibility to therapists 
on, demand characteristics of the situ- 
The point is that the behavior of 
terminate scene imagination is 
bsent. Its presence or absence 
ormation that can influence a 
s of fear and changes in 
The information function 
operates even when the 
is reversed as in the suc- 
presented by Cautella 
(1966) where subjects are instructed not to 
signal the presence of anxiety, but only when 
the anxiety is no longer experienced to the 
imagined scene. 
2. Progression up the hierarchy as informa- 
tion feedback. An additional source of in- 
formation, when it is included in treatment 
is the perception of progression up the hier 
archy. Lang and Lazovik (1963), Lang, Lazo- 
vik, and Reynolds (1965), Davison (1968) 
and Cooke (1968) have found that the npe 
ierarchy scenes passed is highly re- 


ber of hi : 
lated to fear reduction. Subjects who pass all 


preta 


persuasi 
ation). 
signaling to 
present or à 
serves as inf 
subject's self-report 
avoidance behavior. 
of overt behavior 
meaning of signaling 
cessful procedure 
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or nearly all of the hierarchy scenes show 
greater fear reduction than subjects who do 
not come as near to completing the hierarchy 
in the same number of sessions. The latter 
group is typically no different after treatment 
than subjects who receive no treatment, 

As presented earlier, the use of a graded 
hierarchy is not necessary in producing suc- 
however, when in- 
rocedure, a subject’s 
archy does provide 

improvement, Self- 
Perceived progression through the hierarchy 

i other self-perceived 
ence reports of fear 
and changes in avoidance behavior. 


cal reactions ag in- 
man (1968), Zeisset 
al. (1969), Lomont 
and Doerr (personal 
reported studies in 
vity of subjects re- 
en recorded, No sig- 


ces on the Physiological level 
have been found between treatment and con- 


trol subjects or between pre- and postmeasures 
in subjects receiving successful treatment, 
even though significant differences are ob- 
served in behavioral and self-report measures 
of fear. Following treatment, the desensi- 
tized group sometimes Shows less reduction 

eated subjects in physiologi 


ceiving treatment has be 


An avenue for accounting 
that through desensitization, 
tions and emotional behavior 
cur without concomitant changes on the physi- 


for the findings— 
changes in emo- 


1965; 
Schachter, 
& Singer, 
Valins, 19 
work demonstr: 
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the 
; : in terms of th 
label or describe this state in term 


o investi- 
cognitions available to him. These physi 
gators present evidence that the same Pa 
logical state may be experienced, ae love; 
described as anger, joy, fear, pue or 
or a variety of other emotions, Spe the sit- 
cognitive-perceptual factors irai ua i 
uation in which the arousal is exper minani 

If cognitive factors are major deve ati 
of the affect experienced in a given ‘the emo 
it becomes a distinct possibility eat avoid- 
tion (and subsequently, approach P viding # 
ance behavior) can be altered by prd he 
subject with experiences through w state 
cognitive label applied to the teega t- 
altered. A subject could show the > presen? 
tern of physiological activity in Lad throug 
of a previously feared stimulus, bu could bs 
cognitions acquired in treatment he wis anc 
labeling his arousal in a different emo jon 
therefore be experiencing a different ject ater 
New cognitions available to a su o mag E 
for example, that he has been able e signal 
scenes involving the stimulus without gen 
ing the presence of fear or anxiety ulus 
then is an inappropriate label for $ ers nal 
produced autonomic activity, From Posting 
observations of approach behavio! "s 
emotional expressions of fear prior c to & 
ment are changed following eer a 4 
pressions of excitement and glee at be gare 


to actually handle the previously 
stimulus, 


The Role of Instructed Imagination 


: ing ? 
Evidence was presented at the bea o 
this study that instructed ve PAR eme 
feared stimuli is the only necessary zo bi 
of Wolpe's procedure. There QUU M igi 
two aspects of this process € earel 
subjects learn ways to respond to Mn it 
stimuli other than escaping or ayoa! ‘penel 
1. Controlled attention shifts. The muscle 
of procedures involving changes vere may 
tension concomitant to scene eer anxiety: 
not be due to reciprocal inhibition P technique 
but due to training subjects in A directed 
that allows a termination of attention the con 
toward the threatening situation. In ists Së 
ventional desensitization, the d 
quence of instruction during we napintas 
Scenes is relax, imagine, relax, stop § 


i 


E 
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elay, T 1 
lion m is an instructed shift of atten- 
i MM to threatening scene, back 
Voluntary t allowing repeated practice in 
toward th termination of attention directed 
Sons or E fear-producing stimuli. For rea- 
Scene oe design, termination of 
T Su Du on was not under the control 
à IS group Ped of Davison’s (1968) groups. 
7 of s howed little fear reduction, and 
pute t5 een that self-control of ex- 
‘table in eared stimuli may be a critical 
this, to desensitization. Parallel 
tet? p and Viek (1948) demon- 
ies ination seins could control the 
E" t signs of a noxious stimulus developed 
ving (his of anxiety than did subjects re- 
ination s 1 exposure but for whom 
* control of the stimulus was not under 
tse rela ar reduction techniques that do not 
th fate "an do contain the characteristic of 
mie io enun in shifting attention from 
lis, fp eal stimuli. Lazarus’ (1965) 
stant BÍ directed muscle activity con- 
tenes ies imagination of fear-provoking 
at p a disruption of attention di- 
E en the stimuli. The emotive im- 
bo movit, Ti introduced by Lazarus and 
4, tice z (1962) involves the instructed 
Ww Cateni of controlling attention between 
Olpe Ng and pleasant imaginal stimuli. 
Tiene (1960) reported a decrease in fre- 
SOntie of an obsessive thought by applying, 
; Ec to the imagination of the scene, 
he im Shock which caused a disruption 1n 
i po ned Scene. 
be ine re to contingencies of 
Soldi in the fear-provoking situation. In 
his es g an aversive situation, a subject also 
Wenge exposure to information about conse 
nn n of nonavoidant behavior in that situa- 
Carns Experiences through which a subject 
the à nonavoidant techniques appropriate to 
ley; nxiety-provoking situation should help 
Situa ate fear and avoidance behavior in that 
tion. 
to Eod instruction of behavior appropriate 
b, 3 threatening situation is, as demonstrated 
ti 
la 


nonavoidant 


eisset (1968), one manner in which con- 
2 one can be learned. He found that re- 
in training and instructions in how to 
ke relaxation in an anxiety-provoking sit- 


uation were as effective in reducing anxiety 
as the full desensitization procedure. 

In vivo experiences with the aversive stim- 
uli offer another form of nonavoidant con- 
ng. Dogs in a study by Selig- 
Geer (1968) learned coping 
y aversive situation 
hysically forced the 


tingency learni 
man, Maier, and 
behavior in an extremel 
after the experimenters p 
subjects to make the coping responses. Cooke 
(1966) and Garfield et al. (1967) demon- 
strated desensitization in humans when they 
were encouraged to respond nonavoidantly to 
anxiety-producing stimuli presented to them 


in vivo. 

Vicarious learning by 
engaged in nonavoidant 
stimuli is another contingen 
ence. Bandura, Grusec, and Menlove (1967) 
and Ritter (1968) demonstrated fear reduc- 
tion after subjects watched a peer approach 
the feared stimulus. 

A similar modeling process can be theorized 
to occur during the conventional desensitiza- 
tion process when subjects are instructed to 
imagine hierarchy scenes. As such, a subject 
may be, in effect, “modeling” after his own 
imagined behavior. Although there is no direct 

ion, it is inter- 


empirical support for this noti 
esting to note that the scenes typically 


imagined during desensitization are not of 
the feared stimulus per 56; but rather they are 
scenes in which the subject imagined himself 


behaving in some fashion in relation to the 
feared stimulus (e.g. stepping on a live snake 
or touching a snake). The subject imaginally 


observing himself successively approach the 


feared stimulus may Very well be a process 
parallel to the subject observing a live model 
engaged in these successive approach be- 


haviors. 


> observing a model 
behavior with feared 
cy learning experi- 


SUMMARY 
go both the therapeutic proce- 


A decade à he 
dure of systematic desensitization and its 


theoretical underpinnings were introduced. 
The procedure has been established as a 
very effective technique for fear and anxiety 
reduction; however, in light of investigations 
reported predominantly in the last two years, 
the original theoretical formulations appear 
to be inadequate 1n providing an understand- 
ing of the nature of the learning involved in 
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desensitization. Str 
evidence indicate 


Cognitive ele. 
imagination of fear- 


It appears that the cognitive and social 


peutic situation, which 
desensitization a 


ing 
icarious learning (via in- 
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IS “BEHAVIOR THERAPY” BEHAVIORISTIC? 
(AN ANALYSIS OF WOLPE’S PSYCHOTHERAPEUTIC METHODS) 


EDWIN A. LOCKE? 


University of Maryland 


Wolpe’s psychotherapeutic m 
whether or not they are, 
that Wolpe’s procedures 


The term “behavior therapy” 
by Skinner and Lindsley ? 
proach to Psychotherapy based on behavior- 
istic learning theory. The term has since been 
widely promulgated by Eysenck (1960, 1964) 
and is now used to describe a variety of 
techniques, all of which are (allegedly) based 
on behavioristic principles. 

Broadly speaking, “behaviorism is the doc- 
trine that the behavior of man and animal 
can be fully understood without the use of 
explanatory concepts referring to states or 
actions of consciousness, namely by studying 
only observable behavior [Locke, 1969, pp. 
1000-1001]." The basic premises of behav- 
lorism are (a) determinism: the doctrine that 
all of man's actions, thoughts, beliefs, etc., 
are ultimately determined by forces outside 
his control (typically, by environmental 
forces) ; (5) epiphenomenalism: the doctrine 
that conscious States (eg., ideas), if they 


exist at all, are merely by-products of physical 


was coined 
to describe an ap- 


* The author acknowled 
of William Claiborn of tj 
Mode of Sta: 


» Maryland 20742. 


À €ports IT and III. (Contract 
hington, D. C.: Office of Naval Re- 


t 
uen 
, : is subsequ? 

vidual’s subsequent actions or his 5 pection 


ideas; and (c) the rejection of hp 
as a scientific method. A behavioris «yerbal 
gator may ask individuals to give pe usé 
reports," but such reports may not ubject? 
to make inferences regarding the 5 
mental states or processes. 3 yioris™ 

The validity of the doctrine of ert 0i 
must be judged on the basis of whe one t° 
not the principles it espouses enable pelP 
understand man and to account for bei ] if 
to modify) his actions, Thus, for oxy asset 
a self-proclaimed behavior therapis ; ask 
that he can cure neurosis, it is cele ch 
(2) whether he can, in fact, achiev om 
cures; and (b) whether he amea eap 
applying the principles he claims to Jaims F 
ing and using and the methods he £ es P 
be using (ie. behavioristic princip 


ab 
methods). sa nally Todd 

The present study deals specifically E. 
the second question. The purpose ut 


: erape sor 
article is to analyze the psychoth ehavi? 


Í 
techniques of one proponent of point E 
theory—Joseph Wolpe—from the re} 


ar" n 
A P ds el 
view of whether or not his capu c one 
fact behavioristic, Wolpe's system v is the 

for analysis for two reasons: (@) aft on’ 
that 
ate ON gre 
they ye 
triot ops 
s o 


? Skinner (1964) has recently argued 


should not reject a priori the study of priv 
(images, Sensations, etc.) simply because 
Private. This is à radical departure from. 
haviorism ; however, in actual practice EM 
not use introspection and considers me 


ir B inin£ 
to be trivial and irrelevant in explain 
behavior, 
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tetic; ; 
ME enmphons and therapeutic proce- 
ture: and ES thoroughly in the litera- 
among A ) his procedures appear to be 
« ie most widely used by those 
Breger therapists who treat neurosis. 
ously eres McGaugh (1965) have previ- 
to the ee behavior therapy with respect 
Ingfulness cel of its assumptions, the mean- 
effectiveness its concepts and definitions, its 
tionship ae neurosis, and the rela- 
Oe tee oy and practice. The 
detail, Whi however, was not treated in 
SS exten te Breger and McGaugh's study 
focused sip in its coverage, in that it 
hree diffe many aspects of therapy and on 
study jg rent behavior models, the present 
a e intensive analysis of one aspect 
E approach to therapy. 
Deque eon of the efficacy of Wolpe's 
ut it E is too broad to be dealt with here, 
this E be granted, for the purposes of 
ertain ea that Wolpe is able to achieve 
lor Glee results with certain patients, 
is: How phobia remission. The question 
`o AE a he do it? 
be a pa ad the fact that Wolpe claims to 
Coreti aviorist, let us examine his formal 
ical system. 


W 
OLPR’ 
E's FORMAL THEORETICAL SysTEM 


theoretical concepts, for 
> “conditioned 


“reinforce- 


a x main 
* i ition „conditioned reflex,’ 

nt» ^ drive reduction," 
tists are all drawn from well-known behav- 
Ski © Such as Pavlov, Watson Hull, and 


D. ner, Y. P : 
3 Wolpe views behavior as determine! 


exte: 
Y nal stimulation in conjunction with 
that 


tts in 
E € bati the nervous system. He argues 
vha lent has . . . no choice in becoming 


Wo, € is [Wolpe & Lazarus, 1968, p. 16 P 


9lpe 

hey. further asserts: 

Qu ay "Ment of consciousness depends upon specific, 
cural activity- [Other 
that stimulation of 
urfaces of the 
ast experiences 
ess would 


c 
Cr hay 
nts have observed] - - + 


a specially placed observer ( 


o 
ese p 
iL eini events [Wolpe, 1958, P- 16].* 


by. Thi 
ag, Pe fingi 
nh nd " " $ 
gy ing that direct stimulation of a pe 
es not prove 


oc rson's 
uc h : 
es memory images do that 
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It is clear that Wolpe views conscious ex- 
perience as the by-product of brain activity 
that is, as an epiphenomenon. [ 

Consistent with his behaviorist premises, 

Lu 
Wolpe does not attempt to define neurosis as 
such but rather defines neurotic behavior, 
which is i 
any persistent habit of un 
by learning in a physiolog 
Anxiety is usually the centra 
havior, being invariably present i 
tions [Wolpe, 1938, p. 32]. 
His formal definition of anxiety is also purely 
behavioral: 


adaptive behavior acquired 
ically normal organism. 
1 constituent of this be- 
n the causal situa- 


the autonomic response pattern 
e characteristically part of the 
to noxious stimulation [Wolpe, 


By anxiety is meant 
or patterns that ari 
organism's response 
1958, p. 34]. 

The usual manifestations [of anxiety] are tachycar- 
dia, raised blood and pulse pressures, hyperpnea, pilo- 
erection, mydriasis, palmar hyperidrosis, and dryness 
of the mouth [Wolpe, 1958, p. 35]. 


According to Wolpe "anxiety responses" are 
acquired through a process of conditioning, 
for instance, the subject is hurt by some phys- 
ically noxious stimulus, and his subsequent 
“fear response" to that situation then gen- 
eralizes automatically to other (similar) sit- 
uations. Such responses can be removed by 
“conditioning” some other (positive) response 
to the same situation. All of this is asserted 


ntents of consciousness are produced 
brain activity over which the indi- 
control. For it has also been found 
dividual sets himself a purpose to 
s of events, one can record 
he electrical activity of his 
b). A behaviorist who took 
that ideas were the end 


all acts and co 
by antecedent 
vidual has no C! 
that when an in 
think of different type: 
systematic changes in t 
brain (Brown, 1970a, 1970 


the first finding as evidence r E 
point of a causal chain beginning with brain activity 


to be consistent, have to conclude from the 
that brain activity was the end point 
hain beginning with conscious aWare- 
ness! Since these two positions are mutually contra- 
dictory, there is obviously some flaw in the deductive 


process. more plausible position on the mind-brain 
relations shile consciousness presupposes a 


brain, th ot identical, that is, the former 
js not reduci! e latter. For an excellent. refuta- 
tion of the ductionism, see Blanshard, 
1964. PP- 311-340) Furthermore, in order to explain 
such act as purposive thought and deductive 
reasoning, it is necessary to grant that conscious ideas 

i both subsequent brain activity and subse- 
a recent view along these 


would, 
second finding 
of a causal cl 


hip is that w 
e two are n 
ble to th 
doctrine of re! 


al activity. (For 
Sperry, 1969.) 
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to be totally outside the control of the subject 
himself (cf. Wolpe & Lazarus, 1968, p. 16), 
although no Proof for this claim is given, 
Wolpe’s formal Position on introspection is 
Somewhat unclear but essentially behavioristic 
by default, While he sug 


S8gests that one may 
use introspection to dete 


fects of noxious stimulation (Wolpe, 1958, 

D- 34), nothing is said al i 

fying conscious States o 
Having 

theoretical 


istic, let us now examine the actual Procedures 
that he uses to treat patients, 


Wotpr’s PSYCHOTHERAPEUTIC METHODS 
This section is divid 
corresponding to the 
therapy: diagn 


ed into two major parts, 


two aspects of psycho- 
osis and treatment, 


Diagnosis 


emotional reacti Ri 

is typically given the Wil 
naire (Wolpe, 
patient to indi 
€ experiences 
he is easily 
feels * 


Worry": whether 
"discouraged": if he 
or "lonely": and whether he 
aking up his mind.” The pa. 
tient may also be asked to complete a “fear 
inventory,” which asks him to indicate what 
types of “things and experiences . . . cause 
fear or other unpleasant feelings [Wolpe & 


Lazarus, 1968, pp. 172-176].” 


he information gained from these ques- 
tions is Supplen 


nented with life history infor- 

mation, assigned homework, and interview 
data, all of which are aimed at revealing the 
content of the patients negative feelings and 
emotions (Wolpe & Lazarus, 1968, pp. 24ff). 
Tt should not be assumed from this that 
Wolpe merely uses the subjects’ “verbal re- 
Ports” as a shortcut method for measuring 


does 10! 


their physiological responses. Wolpe tell bim 


ask his patients (at this point) ae their 
how tense their muscles feel, or vi fee 
hands are Sweating, or if pedes j 
dry; he asks them if they feel ied » He 5 
"disturbed? or "upset? or “worrie experi?" 
clearly interested in the os This procedi 
tial aspect of their emotions. — H consi 
is an implicit admission that emo onses ( " 
of something more than bodily geo ó hi 
an exceptionally thorough gn : patient 
issue, see Arnold, 1960). Even if P measure 
physiological reactions (however notion f 
did accurately index the type of er would 1? 
(e.g., anxiety, anger, or guilt), they emotions 
reveal the content (objects) of tus as tha” 
that is, those situations and epe ta evel) 
arouse the emotions, Short of he perapi e 
single object in existence into the pysiolo8" 
office and measuring the patient’s Eminin E 
cal response, the only way of pet A 
patient's characteristic emotional re ately: the 
things is to ask him. To answer - ams 2 
patient must use introspection and of co 
both actions require the operation 
sciousness, ^ indicat’ 
Usually the patient is also asked t erience 
the relative amount of anxiety he exP dif 
in different situations or in respon pject™ 
ferent objects or persons using à ; Lagar? 
anxiety scale (SUD Scale; Wolpe pit is 
1968, p. 73). Occasionally, the su that P 
asked to rate the emotional acm he pas 
would expect to have to situations E i 
not actually experienced, To do thi thetic? 
told to imagine himself in the ho eren 
Situations and then to imagine (by in 1 imag 
his reaction to them, Expectation is 
ination are also actions of eonsciousne™. r 
Nor can it be assumed that Wolpe rend t 
ested simply in the patient's “verbal in ad 
as an end in itself, He mentions that ire p 
ministering the Willoughby questionna apli a 
therapist may need to discuss the BÓ re 
tions" of each question with the patient 68 
Scoring his answer (Wolpe & d that 
P. 26). The reason for this is obvious d of 
the therapist wants to know the — su 
the patient's response; he wants to mas „eant 
that the patient understands what 1s Pwel: 
by the question before recording his an o 
Wolpe reports a problem he had treating 


ost 
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Patient 
€ felt is had not considered the anxiety 
e Worth n imagining a certain situation to 
bn)» m report [Wolpe & Lazarus, 1968, 
willy unti the rapy could not proceed success- 
(es What ea Patient revealed her actual feel- 
? an end im patient says is used as a means 
n content e end being the identification of 
‘onal rea, and intensity of his negative emo 
En oe 
H88nostic ua from the foregoing that Wolpe's 
TOspectin ethods all depend on the patient 
i Vealing ed (and possibly imagining) an 
ersity peur (or expected) content an 
Eun. The is internal feeling states (emo- 
ited tea, Procedures could hardly be 
M all pe ioristic in view of the fact that 
“Periences rtain to the patient’s conscious 
NE eS. 
Wo pe nors, the implicit premise behind 
neal his cedure of asking the patient to 
1 ona] ies eoe responses is that these 
hie Pini: aas are the patient's problem. 
ay "unada is that they are the cause of 
ng dance oo behavior,” for instance, his 
th Objective. real life, of situations that entail 
= Dy is threat. As is demonstrated below; 
l rela aimed at modifying these feelings 
ted beliefs) in order that the pa- 


tints acti 
ctions may change. 


te 
ment 


Wolpe asserts 


Pre 

CO. sye 

th; nditions of therapy. » 
tates that the 


s 
3 ti Tim sful treatment necessi 
D Actionis e be assured that his «unpleasant 
the th arg reversible, that is, curable; 
vice Srapist may find it desirable to offer 
», Pe ema om] support, encouragement 
4 iP T spo [Wolpe & Lazarus, 1968, P- 
"us he therapist must also establish “a 
peces and make the patient 
0 y accepted as a fellow human being 
: oe & Lazarus, 1968, P- 281." With re- 
Writ O children, Wolpe's colleague, Lazarus, 


es: « 
“ndere “A warm rapport . - - and a close 
ulfillments and 


ide 
[italics added; 


«A28; 

i. carus & Abramovitz, 1964, p. 461]." The 

ü behind such “supportive” 
ed is that they Will influence the patient’s 

th, Tous attitudes (toward the therapist and 
tapy, etc.) and that these attitudes will in 


TS: : h : 
hg diu of his wish-f 
cations are essential 


Dlici 5 
ac Cit assumption 


atient's actions, for instance, 
n with the therapist. 
Frequently, patients’ “misconceptions” must 
be corrected before therapy can proceed suc- 
cessfully. Examples of mistaken beliefs that 
Wolpe has had to correct in his patients in- 
clude the following: pine tree bark secretes 
poison (Wolpe, 1958, p. 200); masturbation 
is harmful (Wolpe & Lazarus, 1968, p. 131); 
direct genital contact is the only way to 
achieve orgasmic satisfaction (Wolpe & La- 
zarus, 1968, p. 105); and chest pains jnevit- 
ably signify heart disease (Wolpe & Lazarus, 
1968, p. 19). In order to treat one patient who 
suffered from intense irrational jealousy to- 
ward his wife, Wolpe had to “establish a clear 
line of demarcation between situations that 
would give legitimate grounds for jealousy and 
those in relation to which jealousy would be 
abnormal [Wolpe, 1958, p. 164]." To con- 
vince another patient that she was not men- 
tally abnormal: 
therapist 
emonstr: 


turn affect the p 
lead to his cooperatio 


had to employ every resource 
ation and the full force of his 
r that she was in truth a 
isabled by neurotic 
1968, pp. 131- 


The behavior 

of reason and d 

own prestige to persuade he! 
ho had become d 


normal person W 
conditionings . - + [Wolpe & Lazarus, 


132]. 
Other patients are 
ity of their religiou 


convinced of the irrational- 
s beliefs by doing assigne 
reading on the topic (Wolpe, 1969, p. 20)- 
The technique of persuasion is not confined 
simply to the patient's mistaken beliefs; it is 
also applied to his moral. values. In order to 
get his patients to become more self-assertive, 
Wolpe frequently has to first convince them 
that abject self-sacrifice is self-destructive and 
that healthy self-assertion is morally proper. 


Wolpe writes: 
ts with 
may 


listic training logi- 
be needed to convince them of 
rights and the 


unavoidability 
of those who 
1968, p. 44]- 


One patient, for example, believed that she 


had no right to expect her aunt to stop taking 


advantage of her. Wolpe said to her: “But all 
g is justice. Is justice too much 


you're expectin 
to expect?” 

s Wolpe, J- The Case of Mrs. Schmidt. Counselor 
Recordings, Nashville, Tennessee, 1965. (Transcribed 


tape recording 
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Patients may also be provided With prac- 
tical knowledge (such ag how to take a job 
interview, or how to answer back to a husband 
or father, or how to deal with an irrational 
boss; see Wolpe & Lazarus, 1968, pp. 48-53), 
as well as practice in applying that knowledge. 
To accomplish the latter, 
i the pro- 
tagonists in the patient’s real-life situation. 
The procedures described above—which are 


therapy—consist in 


changing the Patients mind) Tt is 
publicized as a behavioristic technique, 
SPECIFIC THERAPEUTIC Pro 


Only Wolpe’s three Major techniques are 
discussed here: Self-assertion, Systematic de- 
sensitization, and sexual arousal 


CEDURES 


Self-Assertion 


Tf the patient is diagnosed as being anxious 
and passive in the pre: 
he is urged to become 
his dealings with others: 


The therapist increases the motivation by Pointing 
emptiness of the patient's 
how his fearful pattern: 


Observe that the pati 


iate self-assertive behavior throug’ 
Persuasion, 


it is not, a 
judgment, dis- 
thinking, 

theory, Wolpe implies that the 
assertion are due solely to its 


his formal 
benefits of self-; 


A. LOCKE 


imilar Pt 
Most assertive trainees seem to follow E (a 
tern of evolution, First, there is an PE T 
ness of their non-assertiveness and its intel! 
Percussions. This jg followed by an jts pos val 
appreciation of assertive behavior andi ectualil | 
effects. Increasing distaste for their pup. seem 10 
and resentment towards the forces bet E 
maintaining or reinforcing the nonasserti ts a s 
usually clumsy, men the Pre 
assertive responses, If positive effects i avior ia 
ility of engaging in more assertive isfactions ied 
Occasionally, as emotional sali omin? 


and spread, previously timid and E. id | 
individuals lend to overassert themselves inant W nt 
tually] . . | The patient learns to be pes e 
out being dominating | . . Finally, as of i or 
becomes aware of his growing mastery Fd e 
sonal situations, there develops a genuine chi p 
indifference to minor slights, petty mà 1 


More adequate 
from other indivi i 
negative Self-perceptions in a way thet thee 
Performance of the new behavior [itali 

Wolpe & Lazarus, 1968, pp. 47-48]. » pe 


" 55. 
Note that the Patient’s "awareness, 


n 
“intellect,” his “self-concept,” and P the 
tions are aj granted a role not es subs | 
initiation but in the maintenance ex ction 
quent modification of his self-assertive ^ tha” 
early, a great deal more is involve 
mere inhibition of anxiety 
Systematic Desensitization whos? i 
This Procedure is used for patients [U n 
fears are of noninterpersonal situations.“ of | 
whom 


rial’ sy 
assertive training is not approP' plis! 


2 iot eS j5 
adequate, The Patient and therapist i a | 
anxiety or fear hierarchies in the struct 
sessions, and treatment begins by in be que 


the patient in 


i I Jes 
muscle relaxation. (T Je 
tion of whether 


vent's muse ii 
it is only the patient's Mpal 


is G 
that are relaxed by this potu oF 
with below), Wolpe tells the pa eliberat! 
liberately contract and then to 


y ar 
relax specific muscle groups, Re oy 
the body can be relaxed at will (V this poc 
The implicit premise of purpo? 
the patient's pups x» 
affect the actions of his body eee to ree 

nce the patient has been pecan in p 
€ is instructed to imagine Ww (one 9, 
mind) the Object or situation x the 1 
his fear hierarchies that produce: 


amour : 
at i. e ORY. This procedure implies 
fluence e atient’s conscious purposes can in- 
id " actions of his mind (his ideas)! 
training ab. and the premise underlying the 
Variance muscle relaxation are not only at 
lo the eons but are directly contradictory 
| *serts ee of epiphenomenalism, which 
are the A» conscious ideas (e.g. purposes) 
No indepe points of a causal chain and exert 
iig, cadent influence on the individual's 
3 Syst ser 
atient į en desensitization proceeds, the 
e nxiety ie the therapist of the amount 
Wires no Is experiencing, a procedure that 
Should h e patient to introspect. Again it 
‘Imply e stressed that Wolpe is interested not 
p ie patient's verbal report but in 
Se Wo] 1 of that report to his real feelings 
With De & Lazarus, 1968, p. 71)- 
respect to one patient, Wolpe writes: 
e patient 


tis 

has „Wor 

mE Sigma marking that even though the patie 

iia o paing his finger] at his disposal with 

GE has E [emotional] disturbance, the fact 
70 mean ot done so during a [an imagined] scene 

or i: Droves that it has not disturbed him 

ARA dr eu» a rare patient who makes ust of the 
29]. ly mildly disturbed [Wolpe, 1964, PP- 


t Wolpe told 
feelings [pP- 
he viewed her 
e and distin- 
or else 
]ling her 


Wh 
her le. treating another patien 
(an pr “express your 
ling, Footnote 5]. Clearly, 
wtishaby as something separat 
Pete aS from her overt behavior, 
? exp Ould have been no sense in te 
Tess them. 

qure Mh Wolpe is obviously i 
Je BRS his patient's verbaliza 
X » eni says (along with oth 
NS E nonverbal actions, his pas 
Werin al expressions) is used as a basis for 
S alps? his mental states. Wolpe uses what 
Wh ety observable as a basis for inferring 
Cannot be directly observed. 


nterested in 
tions. What 
er evidence, 
t history; 


Se. 
d: 
ual 4 rousal 


Wolpe 
Xe e's method of treating patients with 
Problems (premature ejaculation, etc.) 


8 
9 re 
n Sh to the doctrine of epiphenomenalism 
qe ds e that is self-contradictory, see Locke, 
St ide es not require that one advocate the view 
tt UE can exist independently of a physical brain 
at one believe in ghosts (see Footnote 4). 


P 
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consists mainly of (a) suggesting new sexual 
procedures to the patient and (5) persuading 
the patient to abandon any irrational or im- 
possible demands that he (or she) makes of 
himself. For example, a patient who is very 
anxious about his sexual performance might 
be told that he should go about sex with the 
attitude that *He has no duty at any stage 
to reach any criterion of performance [Wolpe, 
1958, p. 131].” Pains are taken to make sure 
the partner is "understanding," "tolerant," 
and “affectionate.” 

The mechanism that Wolpe holds responsi- 
ble for virtually all of the improvements made 
by his patients, regardless of the particular 
method used or the nature of the problem, is 
that of “reciprocal inhibition." Let us now 
examine this concept in some detail. 


The Concept of Reciprocal Inhibition 


As coined by Sherrington in 1947, the term 
reciprocal inhibition referred to a specific 
neurophysiological phenomenon, namely, 
the inhibition of one spinal reflex by another, such 
as occurs when stimulation of an ipsilateral afferent 

fa vastocrureus muscle con- 


nerve causes relaxation 0. s 
tracting to a contralateral stimulus [Wolpe, 1958, 


p. 29]. 

In other words, reci 

when one type of refl 

another type of reflex. 
Wolpe, however, argues W 

term that 


procal inhibition occurs 
ex inhibits the action of 


ith respect to this 


be expanded to encompass all situations 


licitation of one response appears to 
t in the strength of evocation 
olpe, 1958, p. 29]. 


Its use may 
in which the e 
bring about a decremen 
of a simultaneous response [W 
According to this arbitrarily broadened defini- 
tion, the following types of events would pre- 
sumably be examples of “reciprocal inhibi- 
Jf a man is self-assertive in a given 
imultaneously be pas- 


he will not s! 
at situation; if a man feels self- 


confident, he will not at the same time feel 
inferior; if a man gets pleasure from sex, he 
will not simultaneously fear it; if a man is 
relaxed in the company of others, he will not 
at the same time be tense; if a man turns left 
at a street corner, he will not turn right at the 
same time; if a man says "goodbye" to a 
friend, he will not simultaneously say “hello,” 


tion": 
situation, 
sive in th 


pde SF 


E 
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Lest this last example appear too farfetched, 
consider the following quote from Wolpe: “ac- 
companying the articulation of any word there 
is ordinarily an automatic inhibition of all 
simultaneous tendencies to Pronounce other 
words [Wolpe, 1960, p. 91].” 

"These examples assert, in essence, that one 
cannot do (or feel) two incompatible things at 
the same time, As Wolpe defines the term 
reciprocal inhibition, therefore, i; represents 
little more than a restatement (in a Narrower 
context) of q fundamental metaphysical prin- 
ciple, which is identified in logic as the law of 
contradiction, This law states that a thing 

f and not itself (A and 
e time and in the same 


ple was identified by Aris- 
totle over 2,000 years ago. 


Wolpe's assertion that reciproca] inhibition 
is the technique by Which his 


rtainly true, it is not very 

- The relevant scien- 
w to get patients to remain 
ns that characteristica]ly upset 


Counts that he of i 
lo remain "calm 
Schmidt” (see Footnote 5 3 
training i 

Wolpe told her to raise 
affected her "state of ca 


r if her muscles were 


More Tecently, Wilkins 
gued on the basis of further evidence that 
relaxation may not be necessary at all for 


“desensitization” to be effective. Wilkins 
claims that it is sufficient to simply have the 


(1971) has ar- 


‘| 
ducing 
patient repeatedly imagine fear T aa will | 
Scenes and to expect that the proce 
help him. + Wolpe” 
This author would suggest mer can be 
actual psychotherapeutic procedure (a) com 
subsumed under five main apertos by 
vincing the patient that he can be o the pe 
(behavior) therapy; (5) a fears 
tient’s irrational beliefs, values, "tnowledt? 
(c) providing the patient with new ersuadi!^ | 
and/or values; (d) helping and pee an 
the patient to act on this new know lax in thel 
(e) teaching the patient how to re situ 
presence of formerly frightening situation’ 
(relaxation training, imagining explicit 
etc.). These procedures dao has the 
or implicitly, that (a) the patien mind | 
ability to introspect; (b) he has a E 
pable of grasping 
uestions, 
e, that he can think); (c) ip | 
ability to purposefully control ae 
his own mind and his overt behavio B 
Furthermore, it should not be assu to " d À 
the particular techniques Wolpe uses | only 
ify a patient's irrational emotions are s the 
or even the most desirable) aget p 
Could be used to accomplish this p 
mentha] (1969), for example, has T «thia 
methods Whereby the patient learns such E 
his way out? of neurotic emotions, sugg"! 
fear of disapproval, Wilkins (1971) $ 
other possible methods. 


stan” is 
concepts, unde: nt 


; unm 
and following logical ere E. 


tions 
d that 
noc" 


Conctuston peut 1 
The claim that Wolpe's psychother t iple 
methods are based on behavioristic resp 
cannot be supported in any importan the pg, 
Virtually everything that is done in m mof 
chotherapy sessions contradicts one cal Tin 
basic behavioristic premise. Nor "T endé 
Wolpe’s procedures behavioristic be eptio” a 
on the grounds that his implicit connai u a 
behaviorism is simply broader than ean A 
by other behaviorists and is thus ei no 
include such activities as "feeling or erst d 
behavior,” “thinking behavior,” unm n ug 
ing behavior," “imagining behavior Tal e 
laxing behavior.” It must be stresset op hic? | 
haviorism as a school and as a phus p 
Movement was a revolt against the dies onb) 
Consciousness; a strict behaviorist stud 


T— 


ec NS P 
Or Ontradiction between behavior 


|] E 
bw 8 (Where the “reinforcement” 
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Observ, 
icit des behavior, To confess that one’s im- 
Study of co On of behaviorism includes the 
'S not a i ME is to confess that one 
ehaviorist. To broaden the term this 


Much 

Makes j 

sary ; CS it both meaningless and unnec- 
y. S 


There ; 
Dect e One minor (and nonessential) re- 
beh viori hich Wolpe’s method does resemble 
e Son E procedures. He places considerable 
' example E in systematic desensitization, 
d When €, scene presentation must occur 
quur bin the patient is relaxed, that is, the 
py) ie is presented only (and imme- 
i rocedure the patient is calm. In this sense 
h Mere d. resemble Pavlovian condition- 
m resent, the unconditioned stimulus must 
he ed in close temporal proximity with 


~~ Conditi : 
lon: litioned stimulus) and operant condi- 
must fol- 


nse). 


mmeg; 
Ih this *diately upon the desired respo 
also 


h res 

en b jai, however, the methodology 
mu hist Procedures used by (mental) asso- 
abre s o Study processes such as memory 
Nother). © one nonsense syllable right after 


ist the- 


ET 
Nope d the practice of behavior therapy by 
Pords cannot be resolved by asserting that 
havi are “conditioned stimuli” that control 
Mus, por just like any other conditioned stim- 
Diti instance, as with à bell eliciting sal- 
“they ! in a dog, While word sounds (or any 
ome pound) may under certain conditions 
liza O serve as signals that elicit automa- 
à lay, *sponses, the effects of words qua units 
a ee are far more complex. Words in 
P Ce ler sense are symbols that stand for 
Nus PES; Concepts represent an unlimited 
Vents of concrete objects, situations, oT 
gum (Rand, 1966). The effects of words or 
z ents on an individual depend upon such 


ae ES if one were to grant (which this writer 
Tbe t) that definitions are arbitrary, it is in- 
to ally upon anyone who radically and funda- 
bupj; anges the definition of à concept he uses 
tha, to d and openly announce this fact in order 
hapy, f ead his readers. To use the term behavior 
met one or example, without acknowledging the fact 
(dS by does not mean what everyone thinks he 
ned j it, is to risk the danger that behaviorism 
Jn. Bo rier eco, way) will be given credit 
in’ Well be ù it does not deserve. This, of course 

th be the intent of those who misuse the term 


15 manner. 


factors as how and if they are understood, how 
their meaning is appraised or evaluated, and 
the individual's decision to act. These mental 
operations are not mechanically conditioned 
from without; man has the capacity to direct 
(within limits) the operations of his own con- 
sciousness. He can initiate the acts of thinking 
and evaluation; he can set himself purposes 
and discover the means to achieve them. The 
understanding of enormously complex actions 
such as thinking and reasoning is not aided 
by arbitrarily attaching the label “conditioned 
stimuli” to the tools (words) man uses to 
accomplish these tasks. 

Since the deviation of Wolpe’s methods 
from genuine behavioristic techniques is so 
obvious and blatant, it is worth inquiring as 


to its causes. 

The answe 
does not actuall 
seems obvious: 
would be impossi 


r to the question of why Wolpe 
y use behaviorist techniques 
if he were to use them, it 
ble either to diagnose or to 
treat neurosis, that is, the procedures would 
not work. Such techniques might be applicable 
to the training of animals (and to an extent, 
to modifying trivial aspects of the behavior 
of autistic children and schizophrenics), but 
no therapist could adequately treat a neurotic 
patient without talking to him, a procedure 
that presupposes that the patient is conscious, 
that he understands words, that he can think 
and introspect.* 

The issue of why, given that he does not 
actually use behavioristic methods, Wolpe still 
claims to do so, is more complex, but a plausi- 
ble reason can be offered. 

For decades psychologists have had an in- 
feriority complex with respect to the physical 
sciences because an important aspect of their 
subject matter—man's mind—could not be 
studied by methods that other sciences had 
declared to be “scientific.” Thus, psychologists 
have been accused (especially by each other) 
of being “mystics” rather than scientists, of 


s One can argue that even with respect to animals 
and autistic children, one needs to posit some degree 
oF conscios awareness on the part of the organism 
in order to explain why the so-called “reinforcer” 
Descriptive behaviorism conveniently 

the necessity for such explanations by 
reinforcement in terms of its effects and 
interest in why reinforcers affect 
see Skinner, 1964, p. 104). 


any 


denying 
behavior (e£4 
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studying “the ghost in the machine,” of be- 
lieving in “animism” and “primitive supersti- 
tions,” and of “taking us back to the dark 
ages.” In response to these Criticisms many 
have attempted either (a) to reduce concepts 
referring to consciousness to concepts ap- 
Plicable to the Physical sciences and thus 
study consciousness “in disguise,” as it were; 


with the study of con- 


ogists have avoided 
ing or admitting they are stud 


ness due to fear of being labele: 
(see, on this Point, the comme 


study- 
ying conscious- 
d “unscientific” 
nts by Rychlak, 


uation is that the ac- 
“unscientific” 


or measured with a r 
lyzed), would attempt to develop new methods 
that were appropriate for the study of this 
phenomenon, The failure to recognize this 
fact, in this writer’s opinion, has not only 
produced enormous conceptual confusion in 
Psychology, it has tremendously retarded the 
scientific study of consciousness. 

In Closing, there are two additional points 
that must be made with reference to behavior 
therapy: (a The accusations made against 
Wolpe Concerning the breach between theory 


and practice are not equally applicable to 
all advocates of behaviorism (e.g., Skin- 


E 
nts rege} 
ner ^); and (b) the author's - inb 
ing the efficacy of Wolpe's techniq D 


u 
3 B S sho! 
ing neurotics to remove symptom: fb! 


study. 


jer, dod 
? Conventional behaviorists, howe ve onset [ui 
On the fact that their subjects are © han i 
Footnote 8) but in a more subtle way 
case with Wolpe. 
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DEVELOPMENT OF THE HUMOR RESPONSE: 
A REVIEW OF THE LITERATURE! 


PAUL E. McGHEE 2 


University of Toronto 


and a developmental theory 
While most of the 


In an early summar 


Y of research on humor, 
Piddington 


noted that Philosophers 


argued that theori 
should go beyond 
based on empirical 
"there are few mor 


to our limited understanding of children's 
humor by summarizing both the theoretical 


Positions that have been advanced and the 
empirical researc 


* This manuscript was prepared 
toral year with Daniel E. Berlyne 
of Toronto, s 
Education, an 


during a postdoc- 
at the University 
upported by Department of Health, 
d Welfare Grant HD 34940-01 from 
the National Institute of Child Health and Human 
Development, 

* Requests for reprints should be sent to Paul E. 
McGhee, who i 


5 DOW at Debartment of Psychology, 
State University of New 


York at Albany, 1400 Wash- 
ington Avenue, Alb 


any, New York 12203. 


ychoanalytic and gestalt theories are discussed, 


humor is advanced by the writer. 
h age differences in humor respon- 
ectual development and children's 


«41 method, 

Attention is also drawn to a. ct 
logical problems and important thor ug 
and empirical issues that confront à 
investigation of children's humor. 

THEORETICAL VIEWS (heorie f 
Only contemporary psychological sper, 
of humor are summarized here, bes a 
attention given to theoretical positio A p 
Specific developmental implications. ad PY | 
cussion of additional philosophical "oun d 
chological theories of humor may be 4) ny 
Piddington (1933), Flugel (1954); 
Berlyne (1969). 


Psychoanalytic Theory {0 


nor X 
The most influential theory of hum yet à 
date continues to be that advanced by scit! l 
in Wit and Its Relation to the pee "n 

(1916). Freud distinguished beme ex, | 
comic, wit, and humor, emphasizing fi Pu | 
nitive, conative, and affective a pri | 
ludicrous, respectively. In each case, t play, 

ciple of “economy” of psychic Enere r ) 
the central role in producing the expe" j | 


Of satisfaction or pleasure. A bet. ue 
thought? or representation character!? ye $ 
comic, Thus, comical situations inve sé 
transition from a set to take som is 
ously to a Perception that it is trivia NI 
Consequentia], The consequent saving on id 
ergy is discharged as laughter. Freud p. z 
ered wit to operate through plays on ? 
9r ideas, with satisfaction generated 
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di 
Savin g 

Pormally 

Urge 


in AH ikea 3 
thie In “tendential” wit, 
Social] S find a tem nace or aggressive ideas 
C sanctioned. Y release through the 
(th. UP instinct joke, and the heretofore 
the form op. ual energies find expression in 
measure Aye laughter. In “harmless” wit, 

pal cle bn from a shedding of our 

Whi Ogical n to maintain a moral, rational, 

t lle the oy node of cognitive functioning. 

lig} © more ee expression of affect is likely 
thilg accompani erate here, a pleasurable de- 
ike indul ins such regressions to a more 

ig p ets and dia in absurd or nonsensical 
' that 22s Finally, the “saving In 
Um ducing iip aa humor is achieved 
ne Would die seriousness of à situation 
ina ive emoti herwise produce some strong 
NA ual er By denying the threat, the 
"ipi lon at e better able to cope with the 
thy © played and, While the economy prin- 
tho! ing, Ply E important role in Freud's 
it. E. alytic = (1954) noted that later psy- 
Chey i have not widely adopted 
ize the fact been inclined rather to em- 
hu antile M of discharge, the regression 
t5 Nor gen minipotences and the fact that in 
nitro] erally the discharge is under the 

White pe’ eo... (Delle 
Wri binis d did not devote spe : 
TN Ts lioe in children, later psychoanalytic 
Br opmental. extended his views along de- 
weed th dimensions. Wolfenstein (1954) 
vh age at while the devices utilized vary 
E the nat basic motive in humor initiated 
sg lenta ild is the overcoming of distress and 
M X release from frustration. Every 
D Ces eriences frustration and anxiety from 
ne Des Key: as the making of impossible 
3 S of nd the overwhelming power and ig- 
S an adults. Much of the children’s joking 
Dossi attempt to free themselves from im- 
"a noU and wishes by picturing 
Mtas ulfillment as ridiculous. The use of 
sinea Z İs important throughout this process, 


aC j Rm 
i; °° the child, in all of his play and joking, 
hile knowing 


asi 
iE 


cial atten- 


j Pret $ - z 
tig "nid something is real wh! 
Wolt 
ol 5 , 
Ye] fenstein asserted that as children de- 
of the 


op 
r they become increasingly aware 


Strict 
Ee. dd placed on sexual and hostile im- 
. Along with simply developing inner 
the joke 


estraj the chil to use 
ints, ild learns 


as a means of expressing these impulses. Gen- 
erally by the age of 6, the child sees the 
need for a joke facade; that is, a mechanism 
that creates an ambiguity between respectable 
and indecent aspects of the joke, thus allow- 
ing it to get past inner and outer censors and 
provide vicarious enjoyme 


Finally, Wolfenstein 
the role of mastery jn children’s humor by 


stressing that an important distinction must 
be drawn initially between whether the ab- 
surdity or nonsense of a joke is set forth by 
an adult or by the child himself: “While they 
can easily break away from the demands of 
reason on their own initiative and enjoy 
fantasy or nonsense, it is not always clear to 
them when others, especially their elders, 


make this shift [p- 196].” Only when the child 


has achieved a higher level of mastery and 
understanding of his environment does he be- 


gin to respond to the humor of adults’ jokes. 
The earliness at which this occurs, according 
to Wolfenstein (1953), depends both on the 
child’s intelligence level and his interest in 
jokes. 

Kris (1938) also emphasized the importance 
of mastery, arguing that to experience the 


comic: 


nt. 
(1954) emphasized 


plete control over that 
ovement on the 


ill seem funny to 2 child 


n is com) 


A preliminary conditio 
An absurd m 


function in question. 


part of another person W. 
only when it has itself mastered the movement. At a 


later stage of development, it will laugh at à mistake 
in thinking only when its own powers of thought 


are firmly established [p. 83]. 


(1957) argued that the child first 
situations when he begins to 
y body movements. When he 
o other children in 


Grotjahn 
discovers comic 


master and enjo. 
] superior t 
, he is likely to see their mistakes 


as funny. Eastman's (1921) 
tains the germ of the sense 
ent with the position of 
Kris and Grotjahn, since the very young 
child's play consists primarily of motor be- 
havior. While laughter appears to accompany 
a sheer joy of bodily movement in most in- 
fants, the mastery of motor skills through 

; probably plays an important role in the 
kind of laughter described by 
Grotjahn and Kris. Levine (1967) also sug- 
gested that laughter and fun are inseparable 


or weaknesses 
view that play con 
of humor is consist 


superiority 
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p to age 12, two-thirds of 
ype. 
1mor is funny, according 
because the Child is 
a fantasy situation. 
laughing because he 
n Or because a cartoon 
Tun over by 
the healthy child) only 
at each will get Up again. 
the child might be 
frightened, but the humor 
would likely be lost. While Speech mistakes 
B errors (eg, slipping on a 
banana peel) are funny in their own right to 
the healthy child, they may become the 
Source of crue] and derisive laughter in the 
child who feels unloved or unsure of himself, 


(1965), 
aware that it js only 


hether a child is 
knocked daddy dow: 


a steam 


Gestalt Theory 


The only additional “classical” psychologi- 
cal 


may 
nge the Meaning of both the whole 
a joke, a 


ar- 

change in the anticipated meaning Or] 
ticular elements leads to a PR pu whole. 
ing and change of interpretation E e sudden 
In a humorous experience, then, xm the 1€ 
hess and unexpectedness with d Maitf . 
Structuring occurs is the critical we in 
also felt that “objectivity” is impor r can 
humor situation, That is, the gem. with 
maintained only if we do not iom i 
the figures involved, if salient en Howeve 
our own do not come into play. prupt 2 
since an objective attitude and * rođuctiv 
Structurings also characterize ue typi | 
thinking, other factors must pei unt | 
humor. The differentiating factor vhi 
in the element of the deiode Aa w. 
found in the humorous experience, 19 
is not present in reasoning (Mine unora 
721” While the logic behind the the lof 
situation is unique to that dou eA wit 
behind productive thinking is consis 
the individual's general experience. t 

Bateson (1969) Stressed the i whee 
reversal of figure and ground occu cture o j 
we “get the point” of a joke. The d e 
the joke draws our attention to cer qat P 
ments while others are deemphasize, 4 j ie 
form a background or setting for grou 
When the punch line occurs, the pac prou » 
material is suddenly and unexpectedly on Fry 
to the center of attention. Like Hb. A 
(1963) suggested that the pete te 
rived from humor is due to the une 
resolution of a series of paradoxes: 
During the unfolding of humor, one 1 when 3 
confronted by an explicit-implicit reversa la r3 
punch line is delivered :.. but the rever oT a, 
the unique effect of forcing upon the. noe 159 
ticipants an internal redefining of reality 


] ply 
sow highs | 

Koestler (1964) has advanced a visite id 

Consistent with the above, althouck e aA 

not claim any gestalt leanings. He m ant 

that various types of creativity, e r 
scientific discovery are based on a px 
istic menta] Process that he calls ;tuati? 
tion”; that is, “the perceiving of a S! abit 
Oradea... 1 two self-consistent but ups M 
ally incompatible frames of reference iati? 
Ip. 35].» With respect to humor, bisoC ug? 
Causes an abrupt transfer of the train of rere"! 
from one matrix to another governed by a jon” 

logic or “rule of the game.” But certain emo 


pic! 


, 


a 
at 
th s 


D 
adem, 


A 
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OWin 

E to A 

hi their greater i 3 

9t follow ae greater inertia and persistence, Can- 

Teason, eae jumps of thought; discarded 
Y are worked off along channels of 


t resi 
Istance in laughter [p. 951. 


None 

has Eie dhe four theorists discussed above 
Work devel ed to extend the theoretical frame- 
fiw? Since s to children's humor. How- 
le eory of a E a primarily cogni- 
EUN umor, the role of children's 

e Ive functioning in the opera- 

S an de mechanisms described be- 
hi Suggested ous area for investigation. If, 
ibit a an by Maier, humor situations €x- 
Fneralize re kind of logic that does not 
n o tes Other situations, it may be fruit- 
psica] the the role of acquisition of 
Ension and ang m children's humor compre- 

appreciation. 


Ap, 
OUusal Theory 


. Mo 

E e gently, two investigators interested 
havigo, ore global problems of exploratory 
Pression (Berlyne, 1960, 1969) and the ex- 
Qs 186 positive and negative affect (Tom- 
th Or ied have offered explanations of 
Pe hum sed on arousal factors operative in 
Resteg Or situation. Berlyne (1960) sug- 
at when a humor stimulus is en- 


tere 
ite in d, an orientation reaction occurs, Tep- 
he p 28 an effort to understand the point of 
urther maintained 


(logo "Or depicted. He f 
Con ain that “humorous 
Arousa factors that can be expecte 
Decte and other factors that can b 
to lower arousal or else keep it w 


Oder 
fop ate bounds |p. 806 ]." The critical issue 
becomes the 


) are capable 


and (5) dis- 
1sal 


situations always 
d to raise 
e ex- 
ithin 


: hi 
Mentis? iype of theory, then, 
OF ea of factors that (4 
tin ich and lowering arousal; 

Sh between the nature of the arot 


To 
Mes Operating in humor, fear, startle, ex- 
With ory, problem solving, etc. responses. 
Ber y Tespect to the former consideration, 
Dhi wh rightfully pointed out that earlier 
he oe peal theorists erred in emphasizing 
xelusr two arousal-influencing factors to the 
With lon. of all others, providing for theories 
aint only limited explanatory power. He 
n A ained that “all, or virtually all, of the 
lin rs that govern arousal can play a part n 
nor [p. 807]-” The arousal-raising aspects 
à humor situation, according to "Berlyne, 


may be influenced either by the expression of 
some “emotional tendency” (sex, aggression, 
etc.), by the presence of collative 
variables in the stimulus situation (novelty, 
surprise, incongruity, etc.—all presumed to 
affect arousal through conflict), or by some 
combination of the two. The main ways in 
which arousal reduction may be brought 
about include gratification of the emotional 
tendency or motive or the achievement of an 
understanding of the "point" of the humor 
depicted. Of course, arousal reduction might 


also be achieved by more extrinsic means, 
such as shifting or withdrawal of attention. 

Berlyne suggested that an “arousal jag" 
many humor situations, while 
in arousal operate in 


superiority, 


operates in 
moderate increases 
others. The arousal jag refers to 
in arousal that is associated with a reward 
actively sought out, reported as 
*pleasant" or “enjoyable” by humor subjects, and 
capable of reinforcing instrumental responses that 
regularly bring it about [p. 806]. 

e of empirical evidence 
pointing to the re- 


a rise 
value, so that it is 


He cited a wide rang 
(Berlyne, 1960, 1967), 
ward value of both arousal increases fol- 
lowed by decreases, and moderate increases 
in arousal, suggesting that these may operate 
either independently or together in humor 
situations. With respect to the second issue 
specified earlier, however, no explanation has 
yet been offered for the differential nature of 
the operation of the arousal jag or “boost” 
in humorous and nonhumorous situations. 
Until such a distinction has been accom- 
plished, this arousal theory must be con- 
sidered an incomplete explanation of humor. 

Tomkins (1962) has advanced a general 
theory of the innate activators of positive 
and negative affect. The critical differentiating 
nsion here is the density of neural firing 
that is, “the product of the 
intensity times the number of neural firings 
per unit time [p- 251] While both positive 
and negative affects (e.g., interest, fear, and 
startle, which differ only in rate of stimula- 
tion increase) may be produced by sudden 
increases in stimulation, Tompkins maintained 
that only positive affects (e.g., laughter and 
) are activated by stimulation. decrease 
dered laughter to be a more intense 
e smile, and: 


dimer 
or stimulation; 


O are 
He consi 
form of th 
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As such it is activated in the same way as the smile 
except that the general density of stimulation which 
is suddenly reduced begins at a higher level in the 
case of laughter compared with the smile [p. 370]. 


The quality of laughter or smile produced, 
according to this view, depends on the nature 
of the originally more intense stimulation, 
In any Case, a certain minimal Steepness of 
the gradient of reduction must be reached 
before smiling or laughing may Occur, with 
the suddenness of the drop and the original 
intensity level interacting to determine the 
nature of the response, 

While the language used b 


y Berlyne (1960, 
1969) and Tomkins 


(1962) differs, Berlyne's 
use of the concept of arousal increase and 
decrease is comparable to Tomkins’ use of 
the concept of increase and 
density of stimul 


It is not necessary 
that moderate rises 
ive to a humor re- 
consequent arousal 
nt since the arousal 
e in similar fashion 


Sponse apart from any 

reduction. Tt is not sufficie 
jag is postulated to operat 
in nonhumorous situations. 


While Berlyne (1960, 1969) has not di- 
rected his theory to developmental considera- 
tions, Tomkins (1962) reviewed the literature 
on the development of the smiling response in 
infancy, and interpreted the data discussed 
according to the operation of the mechanism 
of reduction in density of stimulation de- 
scribed above, He suggested that both the 
smiling response and its triggering mechanism 
are inherited, and he appears to maintain that 
the nature of the operation of this mechanism 
does not change as the child develops. How- 
ever, he concluded that learning and memory 
determine which stimuli initiate the mecha- 
nism. An inherited receptivity of the mecha- 
nism to particular stimuli (i.e., “releasers”) 
is rejected as very unlikely, 

Kagan (1967), in discussing the develop- 
ment of the smiling response to the human 


ema 
E a sch 
face, suggested that “the creation of à 


for an event is one major source P erepre- 
[p. 135]," where schema means t according 
sentation of an external pattern. t smiles 
to Kagan, when a 4-month-old infan o pas 
to a photograph of a regular wee uhis 
actually made a cognitive deve. able t° 
discovery occurs when the infant i amount 
match stimulus to schema. A moderar c c 
of mismatch between event and its furthe 
necessary as a source of pleasure. 

argued: 


adren 

ildre 
' | ight-year-old ch one 
Sixteen-weck-old infants and eight-y 


have 
smile spontaneously at events that pec match 2 
thing in common—the event is a par E of ree? 
an existing schema and an active Do 
nitory assimilation must occur [p. 137]. 


eX 
More recently, Kagan? advanced bo the 
that the critical factor in this [ae m as the 
gradual buildup and release of tens ‘ate 
child exerts mental effort to assimilat, js. 


l 
H : : “hus, t 
new situation into old schemas. Tl ‘incu! 


Aa ae qae : : o : 
similation is either immediate or “imitatio” 
that no effort is made toward as A No. 
there will be no buildup of tension 27 


jo 
vt i 

shif d 
clearly bears a similarity to the te 


Tol P 
by Berlyne (1960, 1969) and able ade 
(1962). Kagan's position is a valua r 


umo 
w 
pelo, 
The theoretical position advanced a 
Was developed on the assumption ro es" 
light of the complexity of the hi mor Faveo? 
it might be fruitful to consider che 


ton Stem 
? Kagan, J. On continuity: A demonstratione Ki 
infancy, Paper presented at the meetu-% O i an 
ciety for Research in Child Development, 
Monica, California, March 1969, 


i zb 
Developmental Theory of Cognitive H 


om 


sleastt® 1 


! 
I 
I 


DEVELOPMENT OF HUMOR 


m 3 
a a ieee types of humor separately. 
un 3E distinction in this respect may 

Upon the LUMBER humor based primarily 

(often NÉ an ios of some salient need 

and re to as “tendential” humor) 

expectations Él primarily on a violation of 
or Télationshine individual has come to hold 
his environm ips among different aspects of 

ÈS used her ent. The term “cognitive” humor 

assumed t e for the latter type, since it iS 

Primarily lat appreciation of such humor is 

satisfactio an intellectual phenomenon, with 

minimal] n of emotional needs contributing 
Y. This type of humor is typically 


exp . 

erien : 
“Clever eed as funny in the sense of being 
Milder | and is generally accompanied by 


Well} forms of affective expression. Thus, the 


kn | 
tr; Wn cartoon of a skier whose skis have 
acks on each side of a tree generates 


um 
or É $ 
Only because of the obvious impos- 


Sibi]; 
Dera of the feat in light of our prior ex- 
Sexua * with the stimulus elements depicted. 
itreley) aggressive, etc., elements appear to be 
q vant to the humor depicted here. ; 
that LS agrees with Kagan's (i 1967) view 
alread € assimilation of à stimulus into an 
Sour V established schema constitutes à 
Ce of pleasure in the infant, where some 


Opt: 4 
Ptimal moderate amount of mismatch be- 
ccurs. Berlyne 


Ween stimulus an 
s d schema O 

Eo 1967) has demonstrated that such 

ile stimuli lead to !n 
‘ome moderate increases 
Yne Ee value in their 0 
OE IS Taro e ra 

robably due to £ 

accompanying ihe infant's success ul nt 
seimilate the stimulus into. the re i : 
Chema, Since one of the earliest perceptual 


Sche infants is that of 
mas d by most !n EAE TTE 
the devel a used to exemplify the 


hum: 

Writer's, theoretical positi Kagan (1967) 
as demonstrated that the pleasure cns 
© derive from the operation of the mechanism 
“escribed above tends to e accompanied by 
Smiling in the infant. Sagan showed various 
Patterns reoresentative of human faces to 

© same group of infants at 4 $, and 13 

Months of age. While significantly more fre- 
Quent smi-ng 9€ urred to à photograph or 

Sculpture of à T gular face than to a regular 
Schematic face o. disarranged face at 4 


creases 1 

in arousal 
wn right (Ber- 
ce of pleasure 
] decrease 
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months, by 8 months of age equal amounts of 
smiling were shown to the regular and dis- 
torted faces. By 13 months of age, most smil- 
ing occurred to such alterations of the human 
form as a three-headed man and a mule's 
head on a man's body. The writer would agree 
with Kagan's explanation that these develop- 
mental changes in smiling reflect the ease with 
which the stimuli could be assimilated into the 


infant's facial or human-body schemas. Maxi- 
mal smiling to increasingly greater distortions 
easing age indicate 


of a regular face with incr 

that *making sense" out of a regular or only 
slightly distorted face becomes so easy that it 
generates little arousal, and consequently, 


little pleasure. 

The critical issue here lies in the determina- 
tion of the point at which the above source 
of pleasure develops into humor. While a 
smile or laugh may be evoked in the above 
situation, it does not become reflective of a 
humor experience until conceptual capacities 
have been acquired. Prior to the onset of con- 
ceptual thinking, the child experiences ex- 
pectancy violation (ie., à stimulus requiring 
some accommodation for successful assimila- 
tion) only in the sense of the stimulus being 
different or novel. The basic prerequisite for 
experiencing such expectancy violations as 
humorous, however, lies in the ability to per- 
ceive the stimulus not only as being different 
put also as inconsistent with the child’s prior 
experience with the stimulus situation. Thus, 
the pleasure in being able to incorporate new 
stimuli into existing structures becomes trans- 
formed into the pleasure of humor when the 
assimilatory process is accompanied by the 
realization that the stimulus situation is 
“backwards,” “wrong,” or “not like it should 
be.” A “reality-fantasy” dimension becomes 
important here, in that the child perceives 
expectancy violations as being funny only 
he has acquired a stable enough con- 


when 
ceptual grasp of “the real world” that he can 
assimilate the disconfirmed expectancy as 


being only a play on reality (see McGhee 
in press). Thus, the disarranged face or 
three-headed man in Kagan's (1967) ex- 
periments becomes funny only when the 
child realizes that what is depicted could 
not really occur, except at a fantasy level 
Prior to the achievement of such conceptual 
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mastery over the 
be made to incorporate the new stimulus into 
relevant schemas the child has developed; 
l occur. With 
higher levels of conceptual mastery, how. 
ever, "fantasy assimilation” occurs. While the 


S assimilated only 
in a fantasy or Pretend fashion, and it is the 


may generate in- 
or fear, but will not lead 
ver, as long as the 


The child’s ability to identify cues indicat- 
ing which mode of assimilation js appropriate 
Plays an important role here. That is, when 
a cartoon or joke is presented to the child, 
he knows that the elements depicted only 
occur at a fantasy level and that to success- 
fully understand the “point” of the joke, he 
Must understand the way in which reality 
factors have been violated. If the child does 
not perceive the cues indicating a potential 
humor situation, or perceives cues suggesting 
the opposite, he will attempt to assimilate the 
new material into his Cognitive structures in 
normal fashion and will fail to perceive any 
humor in the situation, 

While the conceptual child is able to see 
humor in the inconsistencies he identifies in 


: ject 
Jokes, cartoons, word pronunciations, b 
labels, etc., logic plays no part in the vt 
Thus, the child laughs at the mag ng 
mule’s head simply because it is Mr 
discrepant from his prior experience W othe 
stimulus elements involved. One p ie for 
children have difficulty explaining the! ji 
humor in such cases (along with mer anal 
language development) is that they ar store ° 
to go beyond reproducing a memana past. 
the way the situation has occurred in city £0" 
Only with the acquisition of the capa around | 
logical thinking (this should devel child | 
age 7, according to Piaget, 1950) is ‘cal rela- 
able to consider the violation of logic a new 
tionships among stimulus elements E stimu- 
source of humor. Thus, the same kumo | Jogi- 
lus may be laughed at by prelogical oie 
cal children, but the latter have Si with 
"higher level? of appreciation. l dos hun 
providing for a better understanding Td al 
situations that younger children non 
capable of responding to, the o d child 
logical thought capacity also allows Ea 
to comprehend more abstract sources taney 
mor. While for the younger child, ws type 
Violations must take the form of som 0 


: B tive 
of discrepancy of sensory input rela pas 


mite 


his previous experience, the child M in 
acquired logical thinking may find hc ok of 
the logica] implications of a Ung he 
elements that are perfectly consistent Nea the 
child's prior sensory experience. It jes 
acquisition of this capacity that le 
Child to the level of humor respons! 

found in adults 


the 
55 


EMPIRICAL STUDIES 


‘gre? 
Among those studies that have used i : 
as subjects, few have attempted delinea E 
velopmental changes in children's TT. 
humor stimuli. Even fewer have related or | 
findings to any type or theoretical framew 


Age Differences 


ing 
The most typical approach to study 3 
children’s humor has been to simply note E 
differences in children’s response to aum 
types of humor situations. In an early pes 
Walker and Washburn (1919) presented P €; 
of a picture to fourth grade, seventh gra to 
and college subjects and asked the subject 
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first c 
nus the picture with one of three 
fach alter and then rate the funniness of 
ternative ens completion. While one al- 
Situation on a normal or appropriate 
Situations, p, other two formed incongruous 
fourth rà For both types of completions, 
Seventh, aders gave higher mean ratings than 
ratings pens who in turn gave higher 
fest Several. college subjects. In an effort to 
2) ob theoretical views on humor; Justin 
and served the amount of laughter of 
followin girls at ages 3, 4, 5, and Ó in the 
feateq R types of situations: surprise OT de- 
; ie eara superiority and degrada- 
Sa Ros def and contrast, social smile 
Situations ulus, relief from strain, and play 
umor In general, she found an increase 
ease epee up to age 
as bomi the sixth year. V 
Producin to increase in its € 
age diffe laughter as a function of age, nO 
Umor pe were found for the other 
Ever pries observed. For all ages tested, 
Situation children showed more laughter in 
"Inally : in which they actually participated. 
Sentatio or 3-5-year-old children, verbal pre- 
Dict wom was found to be mor 


Oys 


ee of 2-5-year-ol 
n " School situation. 

Drehen. study dealing more w 

askeq sion than appreciation, 

à E pie in Grades 4-1 
g wt of 10 caricature-type C 

Cal important social, economic, 

Ero lens. Using a nationwide sample, t 
ari} that subjects in Grades 4-5 gave pri- 
ids either descriptive responses 
rad interpretations © the cartoons. ^ 

Co: 5 8, however, abstract interpretations 
nstituted the most frequent response. Wells 


1934) asked subjects in Grades 7, 9 and 12 
s of humorous 


ito four different type 
of pe in terms of preference for the type 
umor depicted. For the group as a whole, 
dm following order of preference (from 8 
die to least) was found: (a) absurdity, (b) 
apstick, (c) satire, (d) whimsy- While taste 
is absurdity decreased slightly preference 
cH satire and whimsy i 

ease between Grades 7 and 


e A 
IID 


with grade level was noted for slapstick. Wells 
noted that in the popular absurdity literature. 
it was the incongruity and unexpectedness of 
situations that subjects found amusing. Brum- 
baugh (1940) had trained observers note all 
the incidents of laughter in the classrooms 
of 1,600 Grade 3-6 boys and girls. She found 
that at all four grade levels, fantastic stories, 
such as fairy tales, elicited more amusement 


than any other type of story. 
i between 7 and 18 years 


e, describe the most comical 
y have seen, etc. Breaking his sub- 

(7-10, 11-13, 
d the humor of both younger 
groups to be predominantly visual; these sub- 
jects showed little appreciation of verbal wit. 
Incidents involving someone's discomfiture 
were given tly by the middle 
group. The adolescents of the oldest group 
were distinguished most by their (a) marked 
individual differences in sense of humor, (5) 
greater appreciation of verbal wit, and (c) 
tendency to reflect on why they found par- 
ticular situa 


tions funny. 
(1956), although not 


or, studied the 
of incongruity in young children. 
i o 7 years of age 
photograph of a three-sided 
ked to describe what 
aph was taken from 
le that, unlike the surrounding 
ded building appeared 
front wall—even though it 
d. They found little recogni- 
i f the apparent incongruity of this situa- 
While à rapid increase in 
ound between 53 and 63, 
little spontaneous recogni- 

age 7. Graham (1958) 
oons to first, third, and 


fifth graders in sets of four, and asked the 
subject to choose the funniest of the four. 
His data showed significant differences in the 

de levels. He suggested on the basis 


were shown 4 
and were a5 


three gra 7 La 
of his data that additional experience and/or 
maturation - -- increases the number of ob- 


situations which will be perceived 


jects or 
[p. 327].” Subjects at all three 


as humorous 


336 


PAUL E. MC GHEE 


opean 
and Oriental descent to tell what they thought 


about 15 pictures depicting absurd Situations, 


identification of the 


tuation, with no ac- 
companying effort to resolve the absur, 


levels and content “ 
They found à signi 
between grade levi 
the cartoons, Usin 
an index of humo; 


nd to increase steadily be. 
decreased Sig- 
Grades 4 and 5. McGhee 
; and 9-year-old boys 
representing a primarily 
type of humor (i.e., emotional 
humor were elimi- 


cartoons and jokes 
“cognitive”? 


three different types 
of humor tasks, requiring the subject to (a) 
arrange sequential cartoons in the funniest 
funniest pictorial com. 
pletion of a cartoon, or (c) choose the fun- 
Older subjects demonstrated 
significantly greater comprehension and more 
frequent "correct? choices than younger sub- 
jects for all three humor tasks. Older subjects 
also showed a Significantly greater number of 
interpretive humor explanations for the latter 
two humor categories, and greater humor ap- 
preciation for cartoons in the difficult caption- 
matching task. 

The wide variability in both age of sub- 
jects and operational measures used to col- 


Tits US 
lect humor data in these studies proi e 
from drawing any firm conclusions abo some 
differences in children's humor. There ildren 
Suggestion, however, that younger ie motor 
Prefer a more direct kind of visual ue 
involvement with potential humor T. ir dé 
While older children are able to use ka an 
veloping thought capacities to Eu 
appreciate more abstract sources o ore ad 
While the greater knowledge and m hildren 
vanced cognitive capacities of older C 


u- 
c :on of hi 
do provide for greater comprehension he 
p Iz) 

mor stimuli, 


they do not necessarily west 
child to perceive them as funnier. Come ap- 
of the humor stimulus would certam t 
pear to be a factor here and is discus 

a later section. 


Preschool C. hildren’s Humor 


z -en’s 
All of the studies of preschool caig 
humor known to the writer were mm ob- 
over three decades ago and used ae 
servational techniques, typically in a d nine 
school setting. Jones (1926) observe ge i 
children between 16 and 36 months of quent 
a nursery home, She found the most cane 
laughter-evoking situations for this gro ng 
be tickling, playfully dressing, gentle ba sing: 
romping with other children, and atl 
She also observed that the same stimuli ryin 
arouse laughter on one occasion, but eas 
on another. Enders (1927) observed 2- chool 
old children's laughter in a nursery ue 
during free play periods and upon preme 
tion of toys, pictures, false faces, and in 
involving vigorous arm and leg movem cing 
The situations Most effective in produ in 
laughter were those involving rat 
sound and/or movement. Gregg obse an 
22 nursery school children for 40 hours atly 
found that laughter occurred most Sae gn as 
in connection With some type of physica hter 
tivity. Washburn (1929) studied the laug th 
a group of infants monthly dee no 
first and twelfth month of age and foun o 
significant relationship between amount tal 
laughter and chronological age, developmi t 
evel, or a height-weight index. Buhler (193 at 
also studied infants? laughter. She found thé 


in 
* Gregg, A. An observational study of humor 7 


+ ja 
three year olds. Unpublished master’s thesis, Columb 
University 1928, 


| 
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laughing 
is mud begins by the second month 
ance or en evoked by another person’s 
Ween the a up to the fifth month. Be- 
Shows ą pr th and ninth month, the infant 
ng QS elie of laughter in the follow- 
( months) u accompanying play movements 
months) ipon receiving bodily stimulation 
Upon hearing when offered toys (7 months), 
N posture ese (8 months), upon changes 
( qonths) months), and upon awakening 
en s 
chi pasting (1931) recorded 2-4-year-old 
Schoo] Spontaneous laughter in a nursery 
MET cue and upon presentation of toys 
Showed TEASER situation. Few children 
8; Nr en upon presentation of the 
ong CUtiosit, r, most children showed interest 
luce] y. The two situations found to 
Children aughter most frequently in these 
Vere so „Were those that (a) they realized 
cially unacceptable and (b) involved 


odi] ; 
Ser Y motion, Ding and Jersild (1932) ob- 


ved d. 0 
4 ne smiling and laughter of fifty-nine 
-old Chinese children during free play 


and 
tte, mal activities in a combined kinder- 
b vem nursery school. They found that 
Wently ling and laughter occurred most fre- 
tivity in association with general motor ac- 
reso children who smiled most often 
ey s the greatest amount of laughter. 
[0A Suggested that laughing and smiling 
Orms si motor outlet in response to many 
Tack, of stimulation in young children. 
ett (1933) observed twenty-nine 18- 


"m 
ij. Onth-old children in a nursery school 
routine adult- 


ee 
ded situation and during rou: 

te activities. She found à nigh degree 
Wo t sistency in children's laughter in the 
igh Ypes of situations. Also, subjects ranking 
One in laughter in this study ranked highest 
e i later. Using the same set of subjects, 
E (1934) found positive relationships 
ogie frequency of laughter and chrono- 
hej ^" age (.44), mental age (.26); and a 

oe index (.89). 
Race data from these stu 
voly to the importance of direc 
chil d in the laughter of very yo 
eee The most typical occurrences of 
be ghter, especially in free play situations, 2D- 

ar to accompany engagement in some on- 
Boing activity and to reflect the child's en- 


dies consistently 
t motor in- 
ung 


P 1 


joyment in exercising the motor skills he has 
developed. While this laughter may reflect a 
basis for humor in the degree of mastery of 
various motor skills relative to other children, 
in the fashion suggested by some psycho- 
analytic theorists, it is equally likely that 
such instances of laughter are not indicative 
of humor in the sense of something being 
funny. The laughter may simply be reflective 


of pleasure derived i 


from the incorporation of 
a wide range of environmental stimuli or tasks 
into the motor 


skills developed. 


Cognitive Development and Humor 


demonstrating age differ- 


prehension or appreciation 
changes in the 


While studies 
ences in humor com 


do shed valuable light upon 
humor process a5 the child develops, age in 


itself is not an explanatory variable. Con- 
sequently, a number of investigators have 
gone beyond a simple description of age dif- 
ferences, attempting to account for such dif- 
ferences in terms of differential level of cog- 
nitive development. 

Bird (1925) used elementary school sub- 
jects and found a correlation of .89 between 


IQ and success in & humor test based on 
absurdity situati (1927) and Ding 
and Jersild find any sig- 
nificant relationship between measured IQ and 

i 2—5-year- 


] setting. "Kenderdine 


laughter and 
100 possibilities for laughter, à group with a 


neat TO. of 141 averaged 13 instances of 

sed to 4.7 for a group with 
n (1932) found cor- 
number of laughter 
situation to 
j age 3 to 12 at age 6. Wil- 
liams 1946) had 11- and 12-year-olds rank 
cartoons m most to least funny. 


Defining I 
agreement 


humor appreciation in terms of de- 
gree of ag with a standard order ob- 
tained from adults, he found greater apprecia- 

ith more intelligent children. Overlade * 


tion with mor t 
found à significant positive relationship be- 


s Overlade, T. Humor appreciation as abs! 
'. a as abstract 
ability- Unpublished doctoral dissertation, Purdue 


University; 1954. 
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tween comprehension of jokes and American 
Council i 
to discern embedded figures. 


he data obtained in these studies are dif- 
to interpret both because of the rela- 


humor items 
pled. If 4 


tremely simple or very complex 
group in question, it is unlikely that either 
i appreciation would correlate 


The role of IQ in each of 
these variables Should be manifest most 
clearly in cartoons or Jokes at à moderate 


appreciation of it. 
Zigl 


degree of comprehension, 
>» it should be difficult to predict 


- MCGHEE 


siye ones | 
sions other than these purely cope ee in 
will generally also play an mpor an themes 
determining perceived funniness Eg short, 
of aggression, superiority, etc.). only one 
given that we have information ae d very 
of these variables, we have ie humor 
little about the nature of the child 1 
sponse. 2 t 8^ 
"la a more recent study, ee be- 
(1966a) did investigate the plato 
tween humor comprehension and pou 5 they 
Using subjects in Grades 2, 3, 4, pares 
defined cognitive development in lexity 
age level, assuming level of comp ressiv! 
cognitive functioning to increase ate ae sug 
with age. As noted earlier, their fin petwee? 
gested a curvilinear relationship sion co" 
these two variables. While comprehen tion i 
Sistently increased with age, prts e 
creased through Grade 4, but addition 
clined significantly by Grade 5. In  showin£ 
within each grade level, subjects showe 
higher comprehension scores He ie an 
greater mean mirth response scores. 4 A con- 
his associates advanced a Masa. 
&ruency? principle to explain this dec artoon 
appreciation with age, arguing that “ja 
which make few cognitive demand, are Í 
lower mirth response than those tha 1d'5 
keeping with order 
Cognitive apparatus [p. 514].” In d Gould 
test this principle, Zigler, Levine, an 
1967 eve 
i et Was made to control for dt 
of cognitive demands made on the su irtoo 
a given humor stimulus. Groups of c el for 
Were matched in terms or difficulty leve t 
each of three grade levels (3, 5, and e 
is, on the basis of performance of a 


0 
ely 


ence scores as m 
data showed a me 
for the Moderately difficult cartoons "aded 
three grades. Zigler et al. (1967) conclu 


P 


l 


at the 
COngruenes findings support the cognitive 
à e Eu principle, since 
anit 

GN EVE r^ mirth response was found to 
the LE between th degree of cognitive congruence 
the į mor stils. cognitive demand features Of 
‘ iVidual i: io the cognitive resources of 
k Point B 

ne in pre by McGhee (1971a), how- 
ning at ing level of cognitive func- 
r et al. (1901 of comprehension scores, 
fe (2) syst 7) did not attempt to specify 
fe Surces po ematic differences in cognitive 
Te Ssessed by subjects obtaining dif- 


Coops. COM 1 
-  Bniti prehension scores or (5) how the 


ly, 

E ; emand features vary among their 
teg, he ifficulty levels. 
ea s (1971a) used a Piagetian theo- 
mme t ee in an effort. LOL MGR 
pive d imde deficiencies. The level of cog- 
M Was opment of 5-, 7-, and 9-year-old 
va [our ae in terms of per 1 
ER M ferent Piaget tasks. This study in- 
ation of the importance of degree of acqui- 
Bade Joye pPerational thinking (within each 
latio Vel) for the comprehension and appre- 
ptes o two distinct types © 
"mor ith a primarily cogni 
lego defined earlier). Th 
Umor es, labeled novelty a 
ME Cog differed most essentially in 
lice Ve processing hypothesized to be 

Ty for their compreh: 


Ui; 
l 
Ree eA : 
“iste ch category, the basi 


d incongruity 


ension. For 
s for humor con- 
biet violations of expectations that the 
E t has presumably come to hold regard- 
^ Ysical contextual composition of various 
tuati or behavioral events in the stimulus 
Vio Saone In novelty humor, the 
thi, O0 was manifest in the form of 50 
he * visually or physically discrepant from 
huy eübject's prior experience. In incongruity 


nor 1 : 
aboy > however, nothing was discrepant 
zation of elements 


do i the physical organi 

RD eq rupe d violation for cartoons 
Ore es in this category only occurred at a 

iUo. abstract level upon assimilation of in- 


expectancy 
me- 


Tr N 
Euren provided by different cues in the 
tha Ulus. McGhee was testing the hypothesis 
transition 


a 
E 7-year-olds (who are in 
n preoperational and operational modes 
Unctioning), degree of acquisition of logi- 
thinking is positively related to the ca- 
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the basis for humor in in- 
For the more simple novelty 
othesized that logical think- 
ary for comprehen- 
lations constituting 


pacity to identify 
congruity stimuli. 
humor, it was hyp 
ing should not be necess 
sion, since expectancy vio 
the basis for humor occur at a visual level. 
No significant relationships were predicted for 
5- and 9-year-olds, since most 5-year-olds 
have not yet begun to acquire operational 
thinking, while most 9-year-0 
in this development. 
The data were cl 


hypothesis. For the tw 
humor in the 7-year-old group, degree of 


acquisition of operational thinking was not 
significantly related to humor comprehension. 
Significant positive findings were obtained, 
however, for all three classes of incongruity 

itively advanced 7-year- 


humor. More cogn! 
olds also gave significantly more frequent in- 


terpretive (as opposed to descriptive) ex- 
planations of the basis for humor for incon- 
gruity humor, but not for novelty humor. As 
hypothesized, none of these relationships was 
significant at the 5- and 9-year age levels. 
Analyses failed to show any significant linear 
or curvilinear relationship at any age level 
between cognitive development and humor ap- 

d in terms of the subjects’ 


preciation (define 
ratings on a 1-5 funniness scale). McGhee 


concluded that no support was obtained for 
the cognitive congruency principle. 

McGhee (1971b) used the same subjects 
used in the previous study (ages 5, 7, 
and 9) in a different approach to studying 
the role of operational structures in deter- 
mining comprehension and appreciation of 
humor. Three different sets of humor stimuli, 
containing varying degrees of emphasis on 
tendential versus cognitive bases for humor, 
were used: (a) sequential cartoons, in which 
the subject must arrange the parts in the 
funniest order (Set I); (4) cartoons in which 
some part is missing, and the subject must 
pick one of three alternative completions that 
makes the cartoon the funniest (Set IT); (c) 
cartoons for which the subject must choose 
the funniest of three possible captions (Set 

jr Significant positive relationships were 

n degree of acquisition of oper- 
ng and humor comprehension 


early supportive of this 
o classes of novelty 


lds are well along ` 


$40 PAUL E. 


Set IIT with 9-year-olds. While more cogni- 
tively advanced subjects consistently made 
more frequent “correct” orderings or choices 
at all age levels, this relationship only reached 
significance for Sets I and II at the 7- and 
9-year level. For the age 7 group, a significant 
relationship was found between level of oper- 
ational thinking and the tendency to give 
interpretive humor explanations for Sets T 
and IIT. Again, no significant linear or curvi- 
linear relationships were found between humor 
appreciation and level of cognitive develop- 


: While no significant 
n humor compre- 


resented in this sec- 
cognitively advanced 
demonstrate better 


- It is unclear at this point how a 
child's understanding of a joke, cartoon, or 
real-life situation corresponds to how much 
he laughs at it or how funny he Says it is, 


While earlier Studies tended not to consider 
the distinction between 


in choice of stimulus 
at they be "appropriate? 
St cases, this has meant 
imination of ove: 


materials has been th 
for children. Tn mo: 
the minimizing or el 
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sideration has in many cases been paradox 
cally overlooked. . in this 
Although all of the studies reviewed e cog- 
section investigated the role of level ihe ap- 
nitive functioning in children’s humor, € 
proaches to defining this variable have h, 0 
considerably. The simplest appris 
course, is to merely use age and imu pro- 
level of cognitive development a= b 
gressively with age. However, since wi r^ i 
individual differences accompany --—-5- 
opmental phenomena, if we are to d omnitive 
a child's performance to a given mension 
dimension, some measure of that dime 
Should be obtained. 1ceptu- 
The most typical approach to emi beer 
alizing level of cognitive functioning n did 
in terms of various IQ or related meh nd- 
While the relatively consistent posve 
ings using this approach with ehid m 
Support its usefulness, the use of IQ jsitio? 
alone does not clarify the role of kc 
of particular types of cognitive ski o dif- 
understanding, appreciating, or generating h 
ferent types of humor. Furthermore, Me 
(1971a) found that the degree of hr 
sition of concrete operational thinking 
7-year-old children accounted for wal 5 of 
that could not be accounted for in pe pe 
IQ scores, Thus, while measured IQ m im- 
tapping cognitive skills that are ve lopint 
portant in the humor process, the ai ive 
child is also acquiring other new cog yen 
Capacities that are not measured by Es role 
tional IQ tests, but play an important og 


j in tyP 
In the comprehension of at least certain tY 
of humor. 


Social Nature of Humor Situations 


gntet 
While it is generally assumed that met a 
and humor are primarily social pheno 2 


ing 
the amount of data on children gem est 
this issue is surprisingly limited. Most 0 des 


P ca 
studies were Completed nearly four dec; 


e 
280, and all were confined to children d ap. 
of age or Younger, usine the observation er? 
Proach in nur 


sery school settings. Engine 
(1927), Gregg (see Footnote 4), Kender ty 
(1931), Ding and Jersild (1932), Bra ugh 
(1933, 1934) all found more frequent "p 
ter in nursery-school-age children in cket! 
situations than when alone. While Bra 
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(193 
E p frequent laughter in the pres- 
E usu. well as children, Enders 
TéSence of children seldom laughed in the 
More laught adults. Ding and Jersild found 
Close fien > and smiling in the company of 
Strangers E S, brothers, or sisters than with 
E rocket CES 1934) found that 
*xhibiteq Tp high in amount of laughter 
€ grou ended to prefer other members of 
most idis showing the same behavior. Also, 
their is ren were found to confine most of 
o gode to a few particular children. , 
âttempt h writer’s knowledge, no systematic 
te of ee been made to determine the na- 
Yoke | ose stimulus situations that tend to 
alent; aughter when the child is alone. 
the m (1942), however, after reviewing 
tions a and making personal observa- 
Weasio his own, did discover number of 
i ns for children’s laughter in nonsocial 


INS 
or sae These included sight of @ bright 
Surprise ing object, jogging, mild shock or 
nition, a repetition, the incongruous, recog- 
Active accomplishment of some new task or 
Colnciqo incongruity in words oT ideas, and 
dental events. 


} inet the available data do suggest the 
Ten's ance of social factors in much of chil- 
usi laughter, they do not foster the con- 
Dhenon that laughter is primarily à social 
fac E enon, to the exclusion of nonsocial 
S very Since the nursery schoo ) 

Y conducive to social interaction in 


ral ; 
al, it follows that a large P 


- 
an, 
soeter occurring in this sett i 
okina nature. Similarly, since the stimulus 
E ot 5 the laughter is likely to be shared 
Aeg, TS with whom the child may be inter- 
Dresen the laughing child is likely to be 
ate a in a situation where other cl 
Sit, 0 laughing. This type of experimental 
“on it therefore, does not provide suitable 
“Yoki ions for determining the nature of the 
X M conditions or the relative importance 
Rhter in nonsocial settings: 
important distinction ™ 
ere between laughter as 


st be made 
an index of 
f heightened 
laugh 


P Se avoid giving the appearance of not 
um anding or to maintain social desir- 
- Thus, the critical issue is (a) whether 


smile or laugh necessarily reflects a 
humor experience, and (b) whether a social 
environment—with and/or without social 
laughter—is capable of either creating humor 
out of a previously nonhumorous situation or 
boosting the perceived funniness of an already 
humorous situation. 


the 


Language and Children’s Humor 


While language would appear to be central 
in much of children’s humor, there has been 
little systematic investigation of the role it 
does play and how this changes with age. 
However, a few writers have theorized about 
its importance on the basis of observation 
and anecdotal evidence. Bever (1968) found 
that 5-year-olds laughed more at jokes in- 
volving ambiguities in grammatical surface 
structure (e.g., “Why can’t you starve in 
the desert?” “Because of the sand-witches 
there”), while 10-year-olds laughed more at 
jokes involving ambiguities in deep structure 
(e.g, “Why can you jump higher than the 
Empire State Building?” “Because it can’t 
jump at all”). Gesell and Ilg (1946) argued 
that by the third year the verbal play of re- 
peating à word (like *golly") with someone 
becomes humorous. By the fourth year, he 
begins to enjoy silly rhyming (e.g. “mitsy, 

ds. Wolfenstein 


witsy, bitsy”) and play on wor 
(1954) suggested that the earliest form of 


the young child’s word play centers around 
the ambiguity of his own sexual identity. 
Generally by the age of 3, the child has begun 
to master any doubts about his own sex and 
can accept “you're à boy (or girl)” as a 
joke. But prior to this, there is a tendency 
to take the statement seriously and be threat- 
sexual identity has not yet been 
plished. The mastery of individual 
nti ts the stage for the next level of 
verbal humor about the fourth year. Again, 
d by the wrong name may be dis- 

Jor to this mastery, but funny there- 
s in common names generally 
ater than changes in 
es, with the latter continuing to 
up to adolescence. While 
the first 5 or 6 years 
and of the child's own 
ncrete verbal 


in frequency 
- humor of 


be spontaneous 
on, the child relies on a co 
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formula between about 6 and 11. Children 
memorize jokes in this period, seldom making 
up their own. Grotjahn ( 1957) advan 
view similar to that 
arguing that children 
in jokes when they 
guage characteristics utilized by the joke. 


Some competen 
Modification o 


may be funny if initiated by the child himself 
but disconcerting if 


range of events encompassed by the new con- 
cept, he encounters a conflict of whether to 
take the new information seriously or as a 
Joke. As he gains a more firm conceptual 
grasp of the concept in question, any devi- 
ation is more likely to be placed in the joke 
category. An important factor here may lie 
in the child’s growing capacity to identify 
cues given by the speaker that indicate 
whether the inconsistency (with the child’s 


Br 
Past experience) is put forth in a serious 
joking manner. 4 ques- 

A central issue here again lies in e. 
tion of when laughter as a simple rige e 
tion of pleasure becomes indicative ions 0 
experience of humor, for early altera ce on 
word sounds may lead to pleasure 50 «ord 
the basis of Successful assimilation pe. jt 
Schemas" in the fashion described ear ds of 
may be that alterations of word wá after 
meanings become a basis for humor on tind | 
Conceptual thinking has developed, n" icted* 
the child to realize the inconsistency dep 


Sex Differences 


an 
While a number of studies using boy e 
girls as subjects have failed to — that 
presence of sex differences, those er 
have Systematically considered sex oe oil 
have yielded inconsistent data. ter aP 
(1928) observed that girls show a iud age? 
preciation of the ludicrous at earlie m 
while boys show more at later ages. rig 
baugh ? noted that boys produce more x in 
humorous drawings than girls, althoug£ great 
vidual differences within each sex were 2%, 


tona 
enough to make the distinction questi agre! 
Justin (1932) f 


Subjected 3-6-year-old € r-pr 
to a number of different types of en sex 
voking Situations, but found no mar nda 
differences, However, girls were fou ed t 
smile more than boys, while boys ten" ve 
laugh more than girls. Boys also showe iliam? 
variability in their responsiveness. V 
(1946) found no sex differences in mon 
appreciation of jokes and cartoons howel 
l1- and 12-year-olds, put boys again ile” 
more variability in appreciation scores- d a” 
et al. (19662, 1966b, 1967) failed to fin nsio" 
significant sex differences in comprehen ef 
or appreciation of cartoons among 
normal or retarded subjects. agai? 
Although the data in this area are 3 
relatively meager, the solid experiment, se t 
proach of the Zigler et al. studies |ui 
that through the ninth or tenth year, at pio 
the different Social environment in si£ 
boys and girls mature does not lead to de 
nificant differences in their ability to UP j 
^ Brumbaugh, F, Stimuli which cause laught@ ew 


children, Unpublished doctora] dissertation, 
York University, 1939. 


inal 


degree 


T 


" 
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Stand ; / 
A uer appreciate cartoons. Justin’s data, 
she Du partially conflicting here, since 

their a oys to be more overtly expressive 
act ae sp Whether this does in 
Served” hum the development of a more “re- 
males mor responsiveness in females than 
"esearch can only be determined by further 

on this issue. 


Hum Eo 
0r in Exceptional Children 


To t f 
'sheq 1 reviewer's knowledge; only two pub- 
umor pus have attempted to study the 
of physically or intellectually handi- 


“appe i 

(pm Children. In light of the theoretical 

ort or is the resu 

"DN d, 1916) 
wishes, 


an 

ticinay Ped children, whose 4 

limiteg (7 physical activities 

or ^d, might satisfy their SUPP‘ 2 
Physical activity through the expression 


of h 
shy "Bor. Thus, she investigated the tror 
etween isabilities 27 
motor disab ing activity 


ings gi 
given to cartoons portraying ii 
lit Donactivity by 10-12-year-old poliomye- 
S, cereb aal children. The 
boli ral palsy, and norm? 5; toons 
omyelitis children rated activity Pies 
rp enificantly more funn Sa RD 
Ho O26, and also gave hi 
d cartoons than did cerebra 
o, al children. Thus, suppor 
P 'cbosnalytie theory- fain 
gler et al. (1966b) comp € i 
"termine edes the restricted experiences 
A /or unique personality dynamics 
4j3'ded influenced the dh 
E They attempted tO cq 
aj Enitive level by using panes 
Se retardates, à erent ge. Twenty 
Bes matched for 10 females) in each 
abjects (10 males = cartoons. Normal 
gory were show ificantly 


ub n reater com- 
E pets showe „pression of affect than 
*hension and 9 

A n tari 


“ther iin ded gom d ona 
i group a differ from m other 
ud go GP. nsion. Among all subjects, 
Only" en onaljzed metardates group 
d the b- increase in smiling and 
Tm I s ef increased. comprehension. It 

ghter wi 4 from this finding that the 


Was conclu 


institutionalized retardate subjects tended to 
show more "inappropriate" affect. Among the 
institutionalized retardate subjects, length of 
institutionalization was significantly nega- 
tively related to humor comprehension, but 
was not related to appreciation. Within all 
three groups, comprehension was significantly 
positively related to mental age. 


DEPENDENT VARIABLE IN RESEARCH ON 
CHILDREN’S HUMOR 


Along with the insufficient amount of at- 
evoted to the nature of the stimulus 
situation in humor research, many investiga- 
tors have also failed to give serious consider- 
ation to the dependent variables used and to 
factors that might influence performance on 
those measures. One of the earliest methods 
used to study young children's humor was to 
observe spontaneous smiling and laughing in 
a natural environment. However, the value of 
the data obtained using this method clearly 
depends on the ability of the observer to 
determine the true laughter-evoking stimulus. 
While the observer may define the source of 
laughter in terms of his perception of the 
situation (and have perfect agreement with 
an independent observer), the child may have 
been responding to a different set of cues. 
This possibility is especially strong in a nat- 
ural environment, where the total stimulus 
situation contributing to the humor experi- 
ence is very complex. Even if we may assume 
perfect accuracy in the observer's identifica- 
tion of the stimulus elements to which the 
subject was responding, we have no more 
insight into what the child laughed at, or 
why, than is achieved upon the observation 
of laughter when he sees a cartoon or hears 
a joke. Thus, as with other approaches to 
studying children's humor, additional infor- 
mation must be obtained from the child him- 
self about what he was laughing at and why 
he found it to be funny. However, since it is 
difficult to obtain this information from very 
young children, our ability to gain a com- 
plete understanding of the nature of the youn 

child's humor response is more restricted. By 
using the observational technique as a pre- 
liminary step to suggest stimulus dimensions 
that might consequently be studied experi- 


tention d 
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mentally, this restriction might be partially 
overcome, 

A similar set of problems confronts investi- 
gations using cartoons or jokes as humor 
stimuli, With respect to humor appreciation, 
the most important distinction t 
Concerns the choice of an affective or intellec- 


; Who 
investigators have relied on 
ing and 
ating to 


and therefore indicative of 


greater appreci. 
ation. While this 


assumption is probably a 
; Widespread individua] 


more likely possibility, however, 
the child does perceive some stimuli 
more humorous than ot 
Spond accordingly. 
true for children sh 


even the young, 
known that car 


child ut of P 
Thus, if a young child is taken o with 
classroom and placed in a room 


mig ht 


pild’s 

have produced great laughter in ie nigh 

classroom or at home. Similarly, anxiety laugl" 
lead to the Consistent “Tniappropnar n e 

ter noted above, In spite of our € js diff 

Control or ease the child’s anxiety, it nxiet 

cult to determine the extent to which i iid 
be boosting or toning down the 


play 
adequacy of 

an index of a c wht 
gested elsewhere (McGhee, 1971a) tha má 
different lypes o 

be rated by a 
some ma 
others, 
salient 


Cartoons or er nny: 
Subject as being equally 
Y be more conducive to laughte pe 
Specifically, humor related tO 
emotiona] tendency or nee 
Sexual or aggressive humor) may 


tha 
H ct 

conducive to overt expressions of a (e 
more purely Cognitive types of humor 7e 


mes * 
in which such them? sen 


lus. 


tinge 
An approach using children’s gee Se of 
of funniness also encounters the prol ow" 
evoking socially desirable Seg ae 
ever, this would appear to be reduce ter 
what by the fact that the experimen rati 
given the Subject a scale that allows à jike! 
of “not funny at all.” He may be Jess igh 
in this Situation, then, to give X because | 
rating simply “because it’s a joke” or 5 rate 
he thinks the experimenter wants him 
it highly. 1 rating 
A problem characterizing the verba ffectiv? 
Approach, which is not shared by thea of th? 
approach, lies in the accurate usage 
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MG 
ed scale provided. If, for example, a 
85 to th Scale is utilized, the question, arises 
accurat minimal age at which the child can 
toon ed identify his feelings about a car- 
Scale jee and transfer them to a numerical 
of the younger child may find differentiation 
task "i of the scale a relatively simple 
differenti have difficulty making the finer 
etwee lations required by the scale points in 
à dich n. Clearly, at some point we must use 
Wheth otomous scale and simply ask the child 
It E the stimulus figure is funny or not. 
ls act nay be that each of these approaches 
e ae tapping very different aspects of 
ectiy ild’s global humor response. Thus, af- 
Social measures may be greatly influenced by 
influe variables, while intellectual ratings are 
and need to some lesser extent. On the other 
re aa the latter measure may more accurately 
ni the child's intellectual reaction than 
qe former. While the Zigler et al. 
se a) data on the relationship between 
Eno measures is valuable, additional re- 
ate efforts comparing the two approaches 
ie itd, It may prove more fruitful to 
Sco ke predictions on the basis of a combined 
re than on either measure separately. 

Measures of humor comprehension have 
ti €n less consistent across different investiga- 
Ons, While most studies do not report the 
sig cedures used for scoring for comprehen- 
eM it appears that comprehension has typl- 
a Y been measured in terms of the degree of 
Mi roment of the subject’s explanation with 
of Ess the experimenter defines as the “point 
Es joke or cartoon. This approach is 
ent kened by the fact that criteria for di ox 
is levels of comprehension must be estab- 
ote separately for each cartoon or joke. It 
Ould seem valuable to work toward the de- 
velopment of a general basis for scoring com- 
Prehension which might be applied to any set 
Of humor stimuli. The studies of McGhee 
(19712, 1971b) are based on such a general 
Scoring system, although his system was de- 
Signed for stimuli with a primarily cognitive 
(as defined earlier) basis for humor. A de- 
tailed description of this scoring system is 


Presented in McGhee.’ 

p. E. Cognitive development and chil- 
ehension of humor. Unpublished doc- 
Ohio State University, 1968. 


be 


* McGhee; 
dren's compre 
tora] dissertatio? 


UNRESOLVED THEORETICAL AND EMPIRICAL 
ISSUES 


The most striking conclusion that may be 
drawn from this review is that theory has 
played a minimal role in generating those 
studies that have been completed, making it 
difficult to achieve any coherent integration 
of the data obtained. While classical psycho- 
analytic theory has generated a number of 
studies of humor in adults (see Gollob & 
Levine, 1967; Treadwell, 1967), only the 
Hetherington (1964) study has been com- 
pleted with children. Arousal theories and 
classical gestalt theory also remain to be 
tested using children. Because of the prior 
inadequate attention given to theoretical is- 
measurement problems, and stimulus, 
subject variables in many pre- 
derstanding of children’s 
d very little. While the 
need for more theory-generated research is 
apparent, this may follow as a matter of 
course if more specifically developmental 
theories of humor are advanced. 

Developmental theories of humor must be 
careful to distinguish between à theory of 
humor and a theory of laughter. While much 
of the data on very young children reviewed 
here is based on laughter, it is difficult to 
determine whether the laughter is indeed in- 
dicative of a humor experience. Since it seems 
clear that humor constitutes only one among 
several sources of laughter (see Flugel, 1954; 
Giles & Oxford, 1970), theorists must care- 
fully define their criteria for a humor re- 
sponse. Specific issues that a developmental 
theory of humor must consider include (a) 
the onset of the capacity to experience humor, 
including organismic and environmental vari- 
ables that influence its timing; (b) the nature 
of developmental changes in the humor re- 
sponse; (c) the ontogenetic relationship be- 
tween smiling and laughter as a sign of 
pleasure in assimilation, body stimulation, 

and an index of a humor experience; 


sues, 
response, and 
vious studies, our un 
humor has progresse 


etc., 
(d) the relative predominance of emotional 
or need-related versus more purely cognitive 


pases for humor, both initially and at sub- 
sequent developmental levels; (e) the role 
of cognitive, personality, and social variables 
jn this developmental sequence; (f) the role 
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of language acquisition in early humor; (g) 
the relationship between conceptual thinking 
and the capacity to experience humor; and 
(4) the role of intelligence and the acqui- 
sition of logical thinking in the child's ability 
to comprehend, appreciate, and generate 
humor. 

Before theories of humor may be satis- 
factorily tested, a number of basic methodo- 
logical issues must be settled. These were dis- 
cussed in the preceding section. Once we have 
achieved a better understanding of the inde- 
pendent and dependent variables used in re- 
search on humor, the Study of children's 
humor might Progress more rapidly upon the 
development of a standardized test (or tests), 
which might be available to all experimenters, 
This would maximize comparability of data 
and provide for a much greater integration 
of findings than is presently possible. Along 
with a sampling of varying content areas, 
types of humor, and modes of representation 
(cartoon, joke, etc.), this test should control 
for the level of cognitive demands made upon 
the subject for comprehension of the humor 
depicted, so that stimulus complexity might 
be adequately sampled. 

The nature and role of ari 


in humor remains untested, 
of Berlyne 


Sion, an investigation of the 


role of such 
factors as complexity, 


Surprisingness, and de- 
gree of novelty or incongruity is needed, 


These factors, in combination with accom- 
Panying influences upon arousal, may play an 
Important role in determining the effect of 


stimulus repetition on the child's humor re- 


remains un- 
degree of comprehen- 


€ given little attention to 
Co ee antay an ‘the. humor process. 
Many situations (e.g. a man being run over 
by a steam roller in a cartoon) appear to be 
humorous only because the child perceives 
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cues that tell him that the event in quem 
is only a play on reality; that is, exists one 
at a fantasy level. In such situations, ya 
of these cues may also remove the hum Y 
Thus, it should be fruitful to study the e 
of fantasy in children's humor at differe 
developmental levels. 

The creative aspects of children's huy 
have also received little empirical ame. 
Most studies have simply recorded the pe: 
Ject’s reactions to prearranged stimuli, e 
have not allowed the subject to ge 
humor on his Own. Important areas for the 
search along this dimension include (4) ted 
age of onset of the capacity for sélf-genersc a 
humor; (b) the types of humor cone 
initially and at later developmental sta8 in 
(c) the role of cognitive development ted 
initial and later phases of self-gener 
humor; (d) the relationship between 50 0 
and Personality variables and frequency of 
initiation of humor; and (e) the Lue 
measuring instruments for determining 
capacity or tendency to create humor. ial 

The influence of a social versus nonse. 
environment "pon children's humor also ach 
mains to be investigated. A valuable apP PS 
here might be to provide children with a y in 
array of potential laughter-evoking stimu Jay 
à nonsocial Setting (eg. alone in ‘oo 
room) and record the child's laughter OF ^, 
bal rating in response to them. The influe 2 
of social facilitation upon measures of po. 
ciation might then be determined by 1” ets 
ducing the same set of stimuli under two a 
of social conditions; namely, children pi h 
and either interacting or not interacting jt 
the subject upon Presentation of the sum. 
The need to maintain social desirability ^, 
lest. situations also constitutes an import? 


. . . r me 

Potential social influence upon humor 

sures, and needs to be investigated. thet 
Finally, additional areas needing !U^, py 


ziet. 
investigation include (a) the effect of annie 
on humor responsiveness; (b) sex differen" 
(c) the influence or the sex or behavior 
the experimenter upon subjects’ laughte? id 


verbal ratings; (d) humor comprehension # n- 
appreciation in mentally retarded or emo" .., 
ally disturbed children; (e) the role y indi 
sonality and intellectual variables in ! nd 
vidual differen 


ces in humor preferences: 


a 


| 
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uD the relation between children’s humor 
Sponsiveness and “healthy” personality 
development. 
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THE BOGUS PIPELINE: 


A NEW PARADIGM F 


EDWARD E. JONES? 
Duke University 


If subjects can be convinced th 


measure both the amplitude and dire 
at the m. 


f the biases th 
do not want to 


rament attempts to predict wh 
e uncontaminated by many © 


AND 


at a physiological monitoring device 
ction of emotional response, 


achine says abo! 


OR MEASURING AFFECT AND ATTITUDE' 


HAROLD SIGALL 


University of Rochester 


is able to 
their sub- 
ut their attitudes should 
at obscure paper-and-pencil 
be second-guessed by a 


measures, This assumes that subjects 
ibed in which variations of this “bogus 


machine, Four experiments are descr 


pipeline" paradigm were em 
validate the device in t 
estimation measures of interperson 
differences between such es 
of attitude are based simply on à li 
preference, or on the fact t 
has been made salient. For W 
the bogus device appear much 
perimental settings where one 
feelings, for instance, toward Negi 


Ma 
With “NY Psychologists for many years, beset 
iated with in- 


erring t vexing difficulties associ 
ha s rue feelings from behavior, must have 
line co about discovering a direct pipe- 
Voulq the soul (or some nearby location). 
thei n it be nice if people really did wear 

earts on their sleeves? Or even if we 


inq; Access to reliable and valid physiological 
ang Ors of attraction and dislike, approach 
jd disbelief? But 


avoidance, or belief ar 
10 o (and some would say; thank goodness); 
p i € has yet achieved this last breakthrough 
Present path to invasion of Eu E 

pest i aradigm - 
decre ajor problems 


ars t 
«X EO: S of the m 
ypass some nd poses new 


alas 


Inferen: redet 

ce from behavior 

lems for substantive research along the 
:s based on the simple 


o be second- 
Jf a person could be 


Onvinced that we d? have à machine that 
i, Clsely measures attitudinal men and 
tensity, we assume ghat DE WONG oe mos 
ly :Q this paper was sup- 
med research, nc Foundation Grant GSee? 
to thie der and bY ae Institute of 
Menta Halk Grant MH17180- o the second 
üthor, 


a 
D The paradigm !5 


| emise that no one 
essed by a machine. 


wants t 


ints should be sent to Edward 
or TP" ot Psychology, Duke Univer- 


a Requests for oh 
v Jones, Depart Carolina 27706. 


s 
"ty, Durham, N° 


ployed. In € 
he subject's eyes an 
al attraction. 
sures and the standari 


timation mea: 
e detection principle, on à prin 


hat the affective 
hatever reason or reasons, 
more ready to express neg: 
might normally expec 

roes and the physically h: 


ach case it proved quite possible to 
d to extract theoretically meaningful 
It is not clear whether the 
d verbal measures 
ciple of error 
f a complex attitude 
subjects attached to 
ative affect in ex- 
t the inhibition of such 
andicapped. 


component o 


ct accurately what the machine 
The bogus pipeline para- 

the use of a device or 
tedly measures one's true 
or an issue, some 


tivated to predi 
is saying about him. 
digm involves, then, 
machine that purpor 


feelings about a person 
means of validating the machine in the sub- 


ject's eye, and finally the dependent variable 
itself: a prediction by the subject of the ma- 
chine's telltale reading. 


Before proceeding with a detailed discus- 


sion of this paradigm and its possibilities, it 
may be useful to place the procedure in the 
broader context of measurement in social psy- 
chology. Throughout much of the 1930s it 
could be reasonably argued that the measure- 
ment of attitudes was the central preoccupa- 
tion of social psychologists. There was little 
theory to speak of, only an occasional ex- 
periment to report, and the notion of the- 
oretically relevant situational manipulations 
was only beginning to take hold in the after- 
math of Lewin, Lippitt, and White's (1939) 
studies on autocratic and democratic leader- 
ship. With the development of this manipul 

tive experimental tradition during the ae 
and 1950s, the interest in attitude mea s 
ment was pushed from center stage. Es — d 
the Festingerians (cf. McGuire, pros y 4 
ished great care on the independent xs 
being manipulated, while the dependent m 
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ables used were almost a casual afterthought. 
Instead of exploring the antecedent conditions 
that affected some standardized, reliable, and 
socially important dependent variable, ex- 
perimenters in the Festinger tradition typi- 
cally asked simple, direct, tailor-made ques- 
tions to get at the effects of theoretically 
relevant situational variations. The conse- 
quences of elaborate experimental Scenarios 
were often measured by a check on an n- 
point scale from “like” to “dislike” or a de- 
cision to wait alone versus wait with some- 
body else, Standardization, multimethod tri- 
angulation (cf. Campbell & Fiske, 1959), and 
cross-situational generality were typically 
sacrificed in favor of contextual relevance — 
measures introduced in as unobtrusive, “natu- 
ral,” and fitting a manner as possible. 

Without in any way denying the importance 
of other research styles and strategies (eg., 
McGuire's, 1969, Hovlanders with their search 
for interesting antecedents of standard de- 
pendent variables), we think the emphasis on 
contextual relevance has been generally well 
placed. It is hard to quarrel with the notion 
that the subject should respond to a reason- 
able question that fits into the cover story 
defining the Purposes of the research, and 
that he not be dragged into a morass of com- 
plex and obtrusive psychometric measures. 
But there are obviously problems in assess- 
ing the answers that subjects give to any re- 
search question, and many of these problems 
are exacerbated by the one-shot direct probe. 


In the following section, we review some of 
these difficulties, 


Potential Dificulties with Rating-Scale 
Measures of “True Feelings” 


While many of the following difficulties 
apply to the general case of attitude measure- 
ment, we turn to the attraction area for our 
examples and consider ratings made by the 
subject of a target person. i 

1. Generosity effect. There is a common 
tendency for raters to give target persons 
the benefit of the doubt in recording judg- 
ments with obvious evaluative implications. 
On the kind of antonym scales often used 
in impression research, for example, it is rare 
to find subjects using the negative halves of 
scales like Eenerous-stingy, wise-foolish, or 
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gh to 
friendly-hostile. The same is true, p in 
a smaller extent, with Likert-type subject à 
volving statements with which the pi or dis- 
asked to express degree of aprecon stingy 
agreement (e.g., “She is rather on positi? 
Side"). This general tendency to give 5 
ratings is probably an expression k giving 
realization that it never pays to 5d n 
offense when it is easy not to. We ear 0 d 
that many subjects do not want to apP ositive 
or extreme and choose a moderately P 
rating as a “place to hide” on the S ier con- 
2. Evaluation apprehension. This : " erest 
sideration merges into a more garea exper” 
of the subject to be evaluated by B nd 
menter as normal, mature, percep hensio? 
full of integrity. “Evaluation appre ( 965) 
has been highlighted by Bui Hoo? 
and by Sigall, Aronson, and Van fact! 
(1970) as a potential source of iar, e 0 
error in measuring the dependent ka 57) 
an experiment, Similarly, Edwards k ed 
and Crowne and Marlowe (1964) iai 
that the social desirability of a respon end" 
affect whether or not it is emitted. he situ? 
ing on the subjects interpretation of pe ead 
tion, his desires to be well evaluated n ture he 
him to play it safe, to show how enm 0 
is by not responding with a negative reps 
a critical attack, or to show how pe smal 
he is by letting his ratings refle ior. Jp 
nuances in the target person’s behar anaidi 
short, the subject’s evaluation is a ect? 
may have diverse and context-speciB will t? 
on his ratings. In general, the subject a ra 
to respond as he thinks a mature 
tional person would. lated ' 
3. Experimenter demand. Closely Te í the 
the notion of evaluation apprehens i. 
tendency that Orne (1962) and tty 
(1962) have posited for subjects to cial ef 
please the experimenter or make a spe pis 
fort to confirm what they perceive to den? 
hypothesis. To the extent that this E be 
is operative, it may obviously affect = 
ject’s ratings of the target person—W 
Without his conscious realization. with 
4. Thoughtlessness, A major difficulty pat 
Paper-and-pencil measures of all sorts wt ed 
it is too easy not to care, not to be conc” js 
about the validity of one’s answers. The oy 
no clear penalty for casual sloppiness; 


of 
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REM only the extremes or only the neutral 
is ; Tather than taking the trouble to be 

cerning, 
Erie of “psychologic.” One of the 
alo eff, y noted rating errors is that of the 
erent LE tendency for ratings on dif- 
Telated imensions to be more highly cor- 
alo Med warranted by objective data. 
omain ects are often seen in the evaluative 
Positive] so that a target person who is rated 
Positively on one trait is likely to be rated 
also req y on other traits, But halo effects may 
A ect more subtle rating biases that map 
rait Panera] connotative relations between 
Ories t or the implicit personality the- 
lieves ü d by the rater. Thus, if the rater be- 
to h hat people who are generous also tend 
simil, warm, he will rate the target person 
fact 7i on both dimensions—in spite of the 
ina at the target person may have behaved 
Warm and stingy manner. A different kind 
ena? chologic error may be seen in the rater's 
Tate rdg to restore balance in his ratings, to 
ore "lends of friends and enemies of enemies 
Of er Positively than they deserve. Both kinds 
Tor may be more common with paper- 
tre ot rating measures than with mea- 
refle that commit the subject to action or 
Ct an involving social relationship. Once 


"Bain, it is easier on such measure Bereit 
Tate people the way they should C987. 
M be sd rather than in line with one's 


true ; 
"€ inner feelings. 


Di j 
ficulties with Behavioral and 
Casures of Attraction 


E Strictly speaking, al 
Ogy involves measure 
€ can conceive of a © & Carl- 
quentiality for the subject (arg continuum 
Smith t one en E 
Would be 6a) actors that o dor is 
j long-range, 
Ject to costly, effort aa "important conse- 
er reason perso end might be the 
aire. Since truly con- 


quences, At the O 
Nonymous questio? measures are typically 


Sequential behavior ical to arrange in a psy- 
‘ficult and impr? investigators have often 
chology experime’ 'Aronson and Carlsmith 
Tesorteg to WD? apehavioroid? measures— 


(1969) have pr true behavioral measures 


« Behavioroid" 


surement in psy- 
t of behavior, but 
ontinuum of conse- 


] mea 
men 


distinguishe 


ln. 
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in that the subject commits himself to conse- 
quential action without actually going through 
with it. From the subject's point of view, a 
behavioroid measure is the same as a be- 
havioral measure at the time of commitment. 
If properly convinced, he truly expects to 
serve under the group member he votes for 
as leader (Schachter, 1951), or to escort visit- 
ing Negroes around campus (Marlowe, 
Frager, & Nuttall, 1965), or to have a large 
“drive carefully” sign displayed on his front 
lawn (Freedman & Fraser, 1966). The im- 
portant consideration is that he expects to 
have to live with the very real consequences 
of his behavior, and therefore is hardly likely 
to take his response lightly or be much af- 
fected by evaluation apprehension or experi- 
menter demand. 

The most obvious advantage of behavioral 
and behavioroid measures of attraction is the 
likelihood that the subject will respond in a 
way more readily generalizable to the involv- 
ing situations of real life. But there are dis- 
advantages as well. If the measure is be- 
havioroid, there is always the lingering suspi- 
cion that the subject does not really expect to 

his commitment. A more 


follow through on e 
important problem is that behavior (and be- 
havioral commitment) can be multiply or 


alternatively determined. In studies where the 

behavior itself is the sufficient focus of study 
(as in the effects of social pressure on volun- 

teering behavior, or the role of frustration in 

determining the sociometric rejection of ethnic 

minorities), this problem is not of overriding 
importance. When behavior or effortful com- 
mitment is used as an index of attraction, 
however, the problem may be acute. A re- 
quirement of sociometric measures, according 
to Moreno (1934), is that subjects make 
their choices fully understanding that they 
can lead to restructuring the group. The 
original sociometric test was, then, a be- 
havioroid if not a behavioral measure. But 
one can imagine many reasons that might 
underly a sociometric choice: The chooser 
feels that the group should be a mixture of 
good guys and bad guys, and since others ar 

likely to choose nothing but good sce, 1 re 
up to him to choose a bad one; the “aati 
feels ged for someone he expects eee aa 
reject (Bramel, 1969); the chooser may be 
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more fearful of the target of his choice than 
of others and be trying to appease him. 
Similar difficulties arise with other behay- 
loral measures of attraction, We may reward 
another, not because we like him, but because 
We feel sorry for him. We may shock the 
student in a teacher-student paradigm, not 


cause we think 
Profit by clear 
rculate petitions 
other person be- 
f-esteem to serve 
; rather than because we like 


Same meaning 
infer the same 
Sponse, There 
measure of at 
these indirect 
ministered as 


are many ways to disguise a 
titude or attraction. Some of 
Measuring techniques are ad- 
Part of a mild diversion, and 
Some are projective and involve displacements 
of stimuli or responses. As an example of the 
former, Students of interpersonal attraction 
Often stress their interest in how first im- 
pressions are formed, but develop indexes of 
attraction. from the first-impression ratings 
by adding scores on all the evaluative traits 
(e.g., Jones & deCharms, 1957; Jones, Stires, 


T o dif- 

Shaver, & Harris, 1968). This is we mensi 
ferent from the common procedure p. d 
ing attitudes toward an object by Mié real 
beliefs about it, Also in the attitu ose D 
similar but more clearly proleet YE m calle 
be made of what Campbell (1950) mple here 
“disguised-structured” tests. An exa measure 
would be the test that purports Uomain 9 
knowledge or information about a scored P 
attitudinally relevant facts, which » pias Í 
terms of the prevalent direction o 
the wrong answers given. ; ires v 

As an example of projective m pic 
attraction, Gerard (1965) used a Libo? to 
ture Impressions Test devised by tg? eval” 
obtain an indirect measure of eri situa” 
ations of other subjects in a sanoen. to four 
tion. The test involves writing pene E 
Stylized pictures showing an uon w B 
different group situations. These S appr ac 
then content-analyzed to determine the nain 
or avoidance reactions to others o rest ^ 
Character in each of the pictures. ; us avo! í 
ing algebraic sum of approach T ensure 0 
ance statements was used as à a nis Pree. 
each subject’s attraction to those in rd’ 
ent situation, with results confirming E. 
particular theoretical hypothesis. rojecti 

The problem with indirect or P re uir" 
measures of attraction is that they. itu? 
careful validation, and the pecu Re may 
tions in which they are eventually e oriel Ke 
differ in important ways from t 5 fairly 
validation settings. Even with tests stant 
high validity there is bound to be d rati m 
error variance contributed by idiosync lev? 
terpretations of the items or pictures, P ons? 
of verbalization, and by various 
Styles. 


; nd 
Physiological Measures of Attitude d 


Attraction he 


ft 
It is hardly surprising that form es ane 
Feasons implied by the above discussi nys” 
Vestigators have sought to discover Paua y 
Ogical indicators of attitude. It is me c 

assumed that such indicators are less ortio 
an verbal responses to voluntary disto well 
and defensive denial. Although they were nt 


? Libo, D. Measuring group cohesiveness: “ss, 


nii 
Arbor, Mich.: Research Center for Group Dyna” 
1953, 
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awar 
e of , 
previous unsuccessful attempts to 


"se th : 
ES C anie skin response (GSR) in the 
ampbell d social attitudes, Rankin and 
Gs -level 1955) obtained highly significant 
Spondin differences between subjects Te 
Negro E to contact with white versus 
Lott Gee Although Porier and 
finding 267) were unable to replicate this 
dia find ¢ a better controlled study, they 
tween z low but significant correlation be- 
Eur reactivity to Negro contact and 
"leur (eer scale score. Westie and De- 
Well as ae recorded finger-pulse volume as 
ho as subjects were shown pictures 
Teludiceq. and whites flashed on a screen. 
Slance subjects (as measured by a social 

"ection questionnaire) showed greater 
ictin S but lowered pulse rates to slides de- 
(1959) Negroes, In a similar study, Cooper 
Names oe that GSR magnitude to ethnic 
nese» ve “Mexicans,” “Swedes,” and “Japa- 
ice be predicted from verbal reac- 
t a the same labels. Hess (1965) reported 
With Pupillary dilation seems to correlate 

Positive affect. 

to and similar studies (cf. Leiderman 
piro, 1964; Shapiro & Crider, 1969) 
toual 5 autonomic indicators of affective 
may shed some light on the structure 
Patterning of attitudes when th 
rper, viewed in the con 
atte nental procedures and related to the 
Th of conventional verbal responses to 


e i A 
«C Same situation, As a direct measure of 


Tue”? : i i 
€" attitude, however, physiological Lon 
nd responsive 


ors are judged to be crude anc © 
E des many variables. For one thing, there 
able, Creasing evidence that physiological vari- 
ten, are affected by such cognitive factors as 
iss Uctional set, suggestion, and cognitive 
rj. nee. Gerard (1964) measured finger- 
e amplitude changes during a period in 
Dai ch subjects were to decide which of two 

intings they wished to have. By assuming 
ject pulse rate should be high when the sub- 
abl was experiencing dissonance, Gerard was 
d € to provide à rough physiological chart- 
son of the decision process featuring dis- 
jnance elevation and reduction, Zimbardo, 
Ohen, Weisenberg, Dworkin, and Firestone 
(1969) also showed that predicted dissonance 
arousal and reduction effects can be tracked 


“Xplor 


ator. 
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by changes in GSR. In one sense these studies 
emphasize the considerable promise of using 
physiological indicators to reflect cognitive 
changes, but they also suggest that it is mis- 
leading to contrast voluntary control over 
verbal responses with involuntary physiologi- 
cal indicators. 

A second and more obvious problem with 
the use of physiological measures of attitude 
and attraction is the difficulty of inferring af- 
fective content or direction from such mea- 
sures. As Westie and DeFleur (1959) noted: 


hat conspicuous physiological 
an S [subject] upon being 
exposed to a stimulus object indicates simple favor- 
ability or unfavorability toward the attitude object, 
regardless of his scale-measured attitudes toward 
that object. It is easy to fall into thinking, espe- 
cially when the stimuli are "race objects," that high 
response indicates a negative attitude [p. 3461. 


Shapiro and Crider (1969) raised a number 
of doubts concerning the use of physiological 
measures even in the detection of stress, 
arousal, tension, or activation. It is obviously 
hazardous in the extreme to make the still 
further assumptions that would be required 
to use GSR or pulse-rate measures as indi- 
cators of the direction and intensity of at- 


titude. 

For purposes o 
now to be described, h 
sideration is the plausibility of having physio- 
logical measures of one’s true attitudes and 
feelings. Here our task is greatly facilitated 
by public knowledge and stereotypes about the 
«lie detector” and its use in criminal investiga- 
tions. According to Sternbach, Gustafson, and 
Colier (1962), lie detection by polygraph is 
still in the state of very primitive art, suffused 
by unfounded and often fraudulent claims 
concerning reliability and validity. But Lee 
(1953) emphasized the preventive utility of 
lie detection devices and operators because of 
the fact that many would-be larcenists and 
perjurers believe in the mystique and infal- 
f lie detection. He retold the story of 


It cannot be assumed th 
changes manifested by 


f the bogus pipeline measure 
owever, the crucial con- 


libility o 
the prince who rounded up suspects and told 
them that each was to go into the next room 


plindfolded and pull the tail of the ass therein 
The ass would bray when his tail was pulled 
py the guilty party. The prince examined the 
hands of each suspect after they had toin 
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pleted, in turn, their quiet visit to the next 
room. The one suspect whose hands were not 
black was accused of being the guilty party. 
The ass's tail had been dusted with black 
powder, and the prince assumed that the guilty 
party would only pretend to pull the telltale 
tail. The moral of this Story for our purposes 
is that if people can be made to believe that 
there are devices reflecting their true inner 


attitudes or feelings, the measurement pos- 
sibilities are almost limitless 


VERSIONS OF THE Bocus PIPELINE 


The idea of exploiting false physiological 
feedback in a social psychology experiment is 
not new. Unlike the present 


proposed use of 
the bogus pipeline concept, however, earlier 


uses have manipulated feedback as an in- 
dependent experimental variable, Perhaps Ger- 
ard* was the first to see the possibilities of 
such feedback in controlling certain assump- 
tions that the subject was making about him- 
self and his Proclivities, Gerald attached elec- 
trodes to the forearms of subjects in a stan- 
dard conformity Setting requiring judgments 
of the larger of two multipointed stars. Each 
subject was led to believe that the electrodes 


measuring “implicit muscle movements” in 


r his overt, conscious 
preferences in the situation, 


meter in f, 
and Behar (1966) 


: Used a similar feedback 
technique to convi 


^ Gerard, H. B. Acts attitud 
(Symposia Study Series No. udes 


National Institute of Socia 
1960. 
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to 

they were or were not sexually o 
line drawings of a turtle, cup, flag, m nplifie 
rosebush. Valins (1966) played back nie they 
(fictitious) heart beats to subjects w Geral! 
observed slides depicting nude UST were 
and Rabbie (1961) and Behar (1967) ‘hoy 
successful in making subjects think. nticipd" 
were more or less fearful in a pue to 0b- 
tion setting where they were aee a a 
serve their own (bogus) GSR rea nforma- 
each of these studies, the feedback 7 endent 
tion was a major part of the in °F arious 
variable manipulation defining Ee was 
experimental conditions, The fee ory, af 
embedded in a reasonable cover Pspicion. 
there was no apparent problem of er a de 

We consider the bogus pipeline to than 2 
pendent variable paradigm, uem tee 2 
specific device or machine for meas the i 
titudes and feelings. As indicated 'onvincing 
troduction, the paradigm involves up pro 
the subject that there is a machine feeling’ 
vides valid indication of his true rodie 
hooking him up to such a machine, !! condi- 
ing theoretically interesting stimulis M. up 
tions to the subject while he is honna 
and finally asking the subject to estin expres 
machine reading in terms of a pe 
sumably paralleling the summary pe ex 
the machine, Thus far, we have condu ed- 
periments with two different kinds Fine 
back machinery that illustrate the S 
paradigm: the directional or “differe resent 
GSR and the electromyograph. we each 
Some procedural details and results A lity 
of these devices to demonstrate se get with 
and certain special problems associa 
their use, 


Á 
; z " rough 
Galvanic Skin Response “Breakthrous 


ucte 
Jones, Bell, and Aronson (1971) Ee eff 
an experiment investigating the gom liked ^" 
fects of opinion similarity and being WE Di 
the reciprocation of attraction. In ge Vk 
do we feel about similar people W r like 
versus dislike us? How about dissitis i" 
and dislikers? Jones et al. predicted jissimill 
Jects will be (a) more attracted to a € less at 
liker than a similar liker and (5) igs 
tracted to a similar disliker than 2 € o 1 
disliker, Their reason for predicting spec? 
main effect for dissimilarity was the $ 
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pe cance attached to expectations that 
a sd in a pleasant (being liked by 
( "ssimilar person) versus an unpleasant 
i ME a isliked by a similar person) direc- 
recip We should be especially inclined to 
ME ors aad when we have unex- 
ot T Y gained the admiration of a dissimilar 
disik and we should be especially prone to 
s € someone who dislikes us in spite of 
ii eons we share in common. When no 
o EN ae about being liked or disliked is 
ett ed, we should simply like the similar 
er than the dissimilar other. 
ject a experiment was presented to the sub- 
s as a study of the process of impression 
but nation, Through a series of complicated 
y Plausible procedures, each subject learned 
P another subject was similar or dis- 
ar in attitudes and then learned whether 
Wat pur subject liked or disliked her after 
in ing her being interviewed by the ex- 
ing, enter. Tn some cases (control condition), 
Dj; son about being liked was not pro- 
Subia. As a final step in the experiment, the 
» Ject observed a videotape of the other sub- 
een being interviewed. The interview had 
fede viously taped by an experimental con- 
Was ae At no point during the experiment 
he confederate actually present. 
vig, TOughout her observation of this last 
Qe tape, the subject was hooked up to a 
bee, Machine, the properties of which had 
den CXplained to her previously. The main 
op "dent variable measure of attraction was 
i, by asking the subject to try to pre- 
She tho, final meter reading of the machine as 
time Ought about the other subject. By this 
Subje of course, she knew whether the other 
: e was similar or not and whether the 
Inte, Subject liked her after observing the 
View, The machine was originally pre- 
Fal «d to the subjects as a "new kind of 
Mach’ skin response—you know, GSR— 
Vas Ine." The novel feature of the machine 
Pha allegedly the result of a “breakthroug: ia 
of en e t to measure not only the magnitude 
ive Stional arousal but to distinguish be- 
Few Positive and negative emotional states, 
‘he, Cn like and dislike. The subject was 
Noy Shown a standard GSR machine, and 
tention was drawn to the summarizing 
^t reflected an “integration” of the 


TABLE 1 


Estmates OF “Own GSR” ATTRACTION 
Reapinc (Experiment 1) 


l Being liked 
Attitude n m 
| Yes | No | Uncertain | Total 
Similar 1396 —642 1392 715 
Dissimilar 1852 — 38 981 932 
Total 1624 —340 1187 
Note.—The meter dial on which estimates were to be based 


ran from — 3000 to 3000. 


subject's feelings at a given point in time on 
a scale ranging from —3000 to 3000. 

A number of steps were taken to validate 
the meter in the subject's eyes. She observed 
the accomplice (again via prerecorded tape) 
apparently hooked up to the GSR while think- 
ing about her "best friend? and then while 
considering someone she disliked or was mad 
at, during which time there were appropriate 
movements of the visible meter toward the 
positive and then toward the negative end 
of the dial. She herself went through the 
same best friend, disliked-person procedure 
and, though not allowed to see her own meter 
movements, the experimenter assured her that 
the machine appeared to be in working order. 
Finally, in order to encourage subjects to do 
their best in estimating the meter readings, 
the experimenter stressed that the accomplice 
had been very accurate in estimating ker own 
meter readings, and the experimenter was 
convinced and hopeful that the subject could 
be accurate too. 

Although the meter estimation was the 
measure of the subject's attraction 
each subject was also 
pression question- 
f which consisted 
ting scales, each 
t-unin- 


primary 
for the accomplice, ! 
asked to fill out a first im 
naire, the major portion o 
of twelve 15-point trait ra i c 
expressed in antonym form (intelligen 
telligent, friendly-un friendly, etc.) ' 

The results of this experiment are discussed 
in detail in the original publication, and for 
present purposes their methodological im- 
plications are stressed. The mean estimates of 
“own” GSR attraction reading are presented 
in Table 1. Although the predicted interaction 
is only of borderline significance ($ < .07), 


the means are all in the proper order: If no in- 


TUA 
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formation about being liked is reported, the 
similar accomplice is better liked than the 
dissimilar one; if information about being 
liked or disliked is provided, the dissimilar 
person is preferred. Although in all cases the 
questionnaire followed the GSR estimate, and 
might therefore be expected to be somewhat 
contaminated by it, the impression ratings 
did not confirm the prediction or even fall in 
the right direction. One cluster of items de- 
signed to measure “social attraction” showed 
a general overall (nonsignificant) pr 
for the similar accomplice. With bo 
sures, not surprisingly, 
nficant preference for t 
liking accomplice. 

Tn a replication o 
a different accom 


eference 


th mea- 
there was a highly sig- 


he liking over the dis- 


f this experiment (with 
plice made up to be less 
ery strong preference for the 


tion measure and the social attraction cluster 
ratings yielded different results. This time, 
e the similar ac- 
] attraction ratings 
plice—regardless of 
subject. The meter- 
effects of similarity 
his failure to rep- 
s the meter-estima- 


ere the conventional 
Jones et al. (1971) 
on of the entire pat- 
estimation measure 
easure of how the 
ut the accomplice, 
e data seem to be 
more a reflection of how the subject thinks 
she should feel in such a situation. 

Such a conclusion is consistent with, but 
hardly forced by, the data of their two ex- 
periments. At a more conservative and more 
descriptive level, there is evidence that the 
meter-estimation measure and the attraction 
ratings are differentially responsive to experi- 
mental conditions, but have some proportion 
of variance in common. In the first experi- 
ment, for example, the within-cell correlations 
between the meter estimations and the attrac- 


ex 
econd 
tion ratings averaged .66. In the s$ 


jon W4 
periment, the average intercorrelatio t ings 
81. Thus it is clear that pm - 
are moderately congruent with -— may 
fective reaction, though these corre som - 
indeed be inflated by the aforemen es n 
founding of measure and a mete 
ministration. On the other hand, ent "a 
tern than the attraction ratings, e 
noted, and this suggests that the me some 2, 
tion technique, while more come n y 
deceptive than the questionnaire we p: 
tapping a different aspect (a truer d y SUP. 
attraction. This suggestion is aa b 
ported by the data from an PA 
Cooper (1971), to which we now 


issonanct 
Attraction as a Product of Disso 
Reduction 


ing the P 
Cooper was interested in imet ecision! 
oretical hypothesis that for poss te cons 
dissonance to occur, any VOT a 
quences must have been M ean e ie 
time of the decision, Cooper chose in à W 
hypothesis using female subjects ation a 
leged study of the effects of woes p 
competition on interpersonal ien game a 
designed a two-person cooperatlv turn ki 
Which each partner was to pd an Pr 
tempting to answer problems from anno m 
tude test. Each player was also et y corro" 
whether she thought she was proba jroble? 
or incorrect in her answer to each fia D 
An incentive system was explained tha wer” 
vided maximum credit for answers the ra 
correct and claimed to be correct prized E 
spondent. The system, in effect, ote tim 
team if either partner was que 
(failing to recognize her correct. um d 
excessively confident (claiming inco ae he 
swers to be correct). Some subjects WC" ye 


t 

0 

5) 
gn 


. an ^ e 
led to expect a timid partner, some s we 
confident one. Experimental subject „pe 


2 SSIP p 
given the option of exchanging the ét o 
partner for another randomly poss i 1 
(none did so); control subjects paret P. 
no such choice. Tn the play of the kc e 
jects learned, through controlled imf 
that their partner was in fact ve A e 
moderately timid, moderately overco x [i 
Or very overconfident. After playims 


Same 


HN 
leendent vari et was provided the main 
E partner, triable measure of attraction for 

is mai 

p estimates Measure was again in the form 
liring dino; readings of a GSR machine fea- 
tion, a as well as intensity informa- 
(Slo I S appeared at each experimental 
tered eg m of four to six, having volun- 
dcoblem sd an experiment on “cooperative 
vice aa It was explained that a new 
Ders ^ be used to measure how each 
Dlem-so} about her partner during the 
tibed as nh! game. The device was de- 
tie vse e differentiated GSR, whose prop- 
buteq 1 essentially the same as those at- 
71). P» the machine by Jones et al. 
je dating aee of the overall strategy of 
ecto 28 the differentiated GSR in the sub- 
1 


aq; Yes - 
jacent 2? they were ushered into à smal 
an impos- 


ic equip- 


ent room, When the el 


to the subject’s hand, she would pre- 


porate differ- 
posedly designed 
dback measuring 


subject 
s through 
nder the pre- 
ents in the 
r individual 
ach subject 
about the 
be nonaddic- 
ld be legal- 
as an 
minstered 


lugg Sults of each Su 
in, "onnaire befc 
Supe Unced over the 
Wes meter reading was so and so, on à 

€ from zero (meaning extreme negative af- 
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indicated 
sked the 


FI 


fect or disagreement) to 300 (which 
extreme positive affect). He then a 
subject, “Does this sound accurate to you 
Tn the vast majority of cases, the subject said 
yes and often expressed amazement at the 
accuracy of the meter. An occasional subject 
demurred and suggested that the meter needed 
a little adjustment. In such cases the experi- 
menter purported to adjust the base line and 
proceeded to a second opinion item, again 
announcing the machine's reading and asking 
the subject if this was accurate. In all cases 
this was sufficient. The experimenter re- 
peated this procedure with the second, third, 
fourth, etc., subject. Tt should be noted that 
this procedure uses each subject's acquiescent 
response as evidence to validate the discern- 
ment of the machine in the eyes of the others. 
The effect was very powerful, and the sub- 
ere characteristically quite surprised 
the experiment that the ma- 
ies were fictitious. 
validation procedures, the ex- 
bjects some crucial infor- 


mation about their partners to create the 
site expectations of timidity and over- 
d the problem-solving game was 
Each subject thought she was 
rimenter conveyed 


jects W 
to learn after 
chine’s propert 

After these 
perimenter gave su 


requi 
confidence, an 


then played. 
«Subject C," and the expe 
the same pattern of self- and partner feed- 


back in each session. By indicating whether 
or not the partner gave the correct answer, 
and whether or not she certified the answer 
as probably correct, the experimenter could 
create the experience of playing with a part- 
ner who was either on the timid or the over- 
confident side. 

At the end 
asked to estimate an 
the differentiated GS 
you think of your p 


of 15 problems, subjects were 
d record “what you feel 
R meter should read as 
artner." The subject's 
response at this point was the main dependent 
variable measure of the experiment, Subse- 
quent to this, subjects were also asked a 
series of questions about their partners’ intelli- 
gence, whether they would like to work with 
them again, Ge 3 

For à detailed presentation of the results, 
the reader i5 referred to the original article. 
Tt is sufficient for our present purposes to 
indicate that the theoretical predictions 
were confirmed in detail with the differ- 
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entiated GSR measure. In the choice condi- 
tions (where the subject could have asked for 
another partner), (a) when timidity was ex- 
pected, the extremely timid partner was liked 
better than a moderately timid partner; and 
(5) when overconfidence was expected, the 
extremely overconfident partner was liked 
better than the moderately overconfident one. 
In short, when the subject is personally re- 
sponsible for choosing to play with a partner 
possessing known liabilities, she likes the part- 
ner more when these liabilities turn out to be 
fairly extreme. This is in spite of the fact, of 
Course, that the extremely timid or the ex- 
tremely overconfident partner causes the team 
to lose more money than their moderate coun- 
terparts. This pattern of results was reversed 
when the subject did not have any choice in 
the matter, or when the subject expected a 
timid partner and got an overconfident one, 
or expected an overconfident partner and got 
a timid one. 

These results again provide a prima facie 
case for the utility of the bogus pipeline as 
a measure of attraction. just as surely, how- 
ever, they do not provide the crucial com- 
parative information that such measures are 
better than a direct attraction question. Since 
Cooper was primarily interested in the theo- 
retical question, the role of personal responsi- 
bility in dissonance reduction, he did not run 
Control subjects whose feelings were assessed 
by standard rating techniques. The data he 
obtained from the subsequently administered 
questionnaire provided several indicators that 
would normally go along with an impression 
of attractiveness (e.g., “enjoy working with"), 
"These verbal questionnaire responses did not 
confirm the dissonance prediction. Even in 
the choice conditions, subjects did not express 
any particular desire to work further with the 
extremely timid or overconfident partner, 
though the differentiated GSR showed their 
attraction to such a partner when her proclivi- 
ties were expected. Since the verbal measures 
followed the meter-estimation measure, how- 
ever, we cannot be sure that the former were 
not in some Way contaminated by the latter. 
A more direct test of the difference between 
the bogus pipeline and standard rating pro- 
cedures was of major interest in two studies 
by Sigall and Page, now to be reported. 


ions to 
Bogus Electromyograph and Reactto 
Stigma 


" irectly 

Sigall and Page were interested Ww. ndard 
comparing the bogus pipeline wiae ae ; 
rating procedure. To this end oy expect? 
situation in which subjects might T 
to distort their responses under nor ts are 
conditions. As noted earlier, subjec ddi- 
luctant to be negatively valvati tt 
tion, Goffman (1963) has nicely of in ; 
the awkward and affected — à an 
action with stigmatized others. ith an 0% 
Page confronted female subjects W ud T 
noxious male stimulus person Mt T. 
sented as either stigmatized or Te sti aol 
assessed subjects’? reactions to Mein on 
person under bogus pipeline or rà a 
tions. , Each fi 

The procedure went as folion: posi E. 
male subject, and male accomplice P in 
a subject, reported to a waiting ™ lice € 
psychology laboratory. The casei 
wore massive leg braces or appe? 


d 
"reti 
physically normal. A posted sig? 


s ther -om 
subjects not to talk with one ano 00 


LH t 
ments later a telephone in the sun. u 
began to ring. When it became SRE xa pic 
no one would answer it, the confeder rude M. 
up the phone and was extremely S oy 
gratuitously abusive to the a hof", 
dently had dialed a wrong num vedi 
thereafter the experimenter appe ush 
ingly unaware of the phone call. sma 
the subject and confederate into pu resp? d 
perimental room and asked them tha og 
to a short opinion questionnaire n 


Sisted of five innocuous items ep. "d 
movie and music preferences. sects Tine 
menter then left the room, and su Us sect 
cated agreement or disagreement 


EET 
3 ^ e 

acted as if he were doing the o Y 
he was copying the subject’s respons m^" 


oP” 4 
yan 5 
the experimenter returned he “S int? 4 


im è e 
selected the subject to accompany E d 
Second experimental room. She buo oft 
to bring her questionnaire with her. ns 


co 
The subject was seated before 4 V 


x 


the 
gon ae 

5 Sigall, H., and Page, R. tengan os 
pression of affect. Manuscript in prepara 


al rating 


n ented 
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Ateas 
ae Wheel extending from the console 
Tom pro along a 7-point scale ranging 
| On the c to 3. The drawn scale was mounted 
end of oe and there was a slot at each 
Placed ie scale in which a label could be 
Console pen metal box sat next to the 
a meter his box, labeled “EMG,” contained 
ne on vim a 7-point scale identical to the 
. Marked FS console. The meter itself was 
| Were ¢ MG Output.” Two skin electrodes 
Cables Eure to the box by cable, and 
lookin also ran from the box to impressive- 
The electrical junk. 
as an Foe. ipae described the apparatus 
Mlaineq | ere electromyograph. It was ex 
8 Do 4 an electromyograph (EMG) was 
i plicit of long standing, used to measure 
muscle potentials. The experimenter 


Note : 
d that standard questionnaires used in 
a variety of 


developed 


md EMG was an important break- 
Probe that circumvented many of the 
Ject haa He explained that when the sub- 
Were eld the steering wheel, and electrodes 
Poa seed to her forearms, the EMG 
to an pick up her tendency to turn the wheel 
e -7 Point on the scale. She was told that 
th. 8ctually did not have to turn the wheel, 
Ov, the EMG screened out gross muscle 
B ns and recorded the "first, undis- 
Scrip, reaction, The electrical junk was de- 
the ed as a small computer, which integrated 
oue ctrophysiological input, and the EMG 
Bu meter would reflect a close ap- 
t imation of that integration. Similarities 
Point lie detector were noted, and it was 
Ed out that the EMG was an improve- 
UON the lie detector in that it was 
E ive to direction as well as intensity of 
Donses, 
he experime 


Sy, ; 
Chological research involved 


difficult 
culties and that the newly 


Oc nter then told the subject that 
i “asionally pase-line responses differed among 
quviduals, DA Dic mcase eon adjustment of 
S € EMG would be required. To check on the 
jbject's base Jine, they. would go over the 
tems on the questionnaire and look at the 
Srrespondence petween it and the EMG 

wheel was locked in place, 


fadi he 
«Adings. T disagree" labels were inserted 


. agree” and 
M the console; and the electrodes were at- 
tached to the subject. She was to listen to 


p = 
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the statements as they were read, to attend to 
the drawn scale, and to look at the EMG out- 
put readings if she wished. The experimenter 
read the first item and flipped a switch on 


the computer. The computer’s lights flashed, 
and a mechanical noise was emitted. After a 
ed, and the needle 


brief period the noise ceas 

on the output meter swung to a number. Al- 
though these events appeared automatic to 
the subject, they were manipulated by the 
accomplice in the adjacent room. The ex- 
perimenter continued by presenting the re- 
maining items in similar fashion. On one item 
the experimenter suggested that the subject 
could, if she wished, try to fool the EMG 
either by exercising gross muscle movements 
or consciously concentrating in the direction 
opposite to her true position. The subject was 
reminded that since the EMG recorded im- 
plicit muscle movements, a reflection of her 
first, undistorted reaction, such strategy 
should not be successful. After all items had 
been completed, the subject was asked to 
produce the questionnaire, and the two sets 
of responses were compared. Of course, the 
EMG had been accurate in every case. 

The experimenter then turned to the *pur- 
pose of the present experiment." He said that 
the study was concerned with social percep- 
tion and that more information would be dis- 
closed later. At this point he needed to get 
some information from the subject regarding 
her current perceptions about herself, the ex- 
perimental situation, and the other subject. 
He indicated that while he would obtain this 
information directly from the EMG, he was 
d in discovering how sensitive 
heir own feelings— whether 
w they felt." Thus, 


ject 
dict what she 


It was explaine i c 
uld be maximized if she relied on her un- 


d reactions. He also told her that she 
how well she had predicted after all 
had been administered. The experi- 
resented the items, recording the 
ediction after each one, and pre- 
tending to note the EMG reading as well 
The crucial items asked for the subject’s 
evaluations of the accomplice. The subject 


wo 
distorte 
could see 
the items 
menter 

subjects PT 
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indicated her personal liking for the con- 
federate, and her agreement or disagreement 
as to the applicability of various traits, such 
as “obnoxious,” "pleasant," or "pushy" to 
him. 

In the rating conditions the computer and 
EMG were in view, but turned off. The wheel 
was free to be turned, and the electrodes were 
hidden. The subject responded by turning the 
wheel and orally stating the number to which 
the pointer had been moved, It Was explained 
that concentration and attention remained 
higher when a physical task was involved 
than when a paper-and-pencil questionnaire 
was used, The remaining apparatus was de- 
Scribed as a small Computer used for data 
analysis. As in the EMG conditions, subjects 
were asked to present first reactions, not to 
distort responses, and to be honest. 

A detailed accounting of results is in prep- 
paration (see Footnote 5). Here we simply 
note some of the findings, All subjects were 
convinced that the EMG did in fact Possess 
the powers attributed to it. As expected, sub- 
jects in the EMG conditions tended to be 
unfavorable in their evaluations of the con- 
federate, while those in the rating conditions 
tended to be favorable, This was manifested 
in relative terms and also reflected by the 
fact that on a Scale which ranged from 3 
to —3, mean responses in the rating condi. 
tion were different in algebraic sign from 
mean responses in the EMG condition. The 
EMG Subjects said they did not like the 
stimulus person; rating subjects said they 
did. He was evaluated as pushy, obnoxious, 
and insensitive in the EMG condition, but 
not in the rating condition. In sum, EMG 
subjects seem to have responded reasonably 
to the staged obnoxiousness of the accomplice, 
whereas subjects in the rating conditions ap- 
Pear to have been overly generous, 


Sigall and Page also expected a greater 
discrepancy in evaluation between measure- 
ment conditions when the confederate was 


stigmatized. They felt that most subjects 
would be hesitant to derogate someone with 
a stigma and that therefore especially high 
evaluations would result in the stigma-rating 
condition. Subjects in this condition did pro- 
vide the most positive responses, and, al- 
though the form of the interaction was 
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nifi- 


: istical sig! 
realized, it failed to reach statistica that 


cance. This leaves open the possibile n 
the EMG simply elicits more d 
Sponses than usual rating proced m son 
final experiment reported here provi n alte 
information on the viability of such 2 
native. 


New Look at Stereotypes ted DY 

(ted. P 

One of the findings recently 71969) in 

Karlins, Coffman, and Walters Fas that 

their study of social — E favo” 
white, American subjects provide 


e 
mor 
5 » an 
able stereotypes of “Americans Ec wae 
favorable stereotypes of “Negroes denyin® 
i 


the case in earlier research. without d n 
that those data may reflect a real E t that 
attitudes over time, it is quite s 1 
Such findings may be partly inf "Amer n 
social desirability. Self-criticism by entime? 
and a tendency to express positive "henomel 
about Negroes seem to be active p te asilY 
on many campuses. Such a n n 
could contribute to response distor 
study of stereotypes. A pag? 
A second experiment by Sigel R istor 
(1971) attempted to examine possi s totis d 
tion by using the EMG. A 2 x 2 ri 60 whit’ 
sign was employed in which half 0 acte jst! í 
male subjects indicated how cha was 9 
each of a series of adjective am j n 
"Americans"; the remaining subjec variaba 
same in response to “Negroes.” Las D 
Was crosscut by assessment opa r^ E. 
the previous study, half the su Yeading"; 
sponded by predicting their EMG standa" 
whereas the other half filled out $ ó 
rating forms. : a the EM! 
The procedure used to “validate used ^, 
was essentially identical to the pr ur 
the experiment on stigma. Me d 
there was no need to expose Lue uc 
& confederate, one variation was "- out es 
When the subject arrived, he ap ie 
small questionnaire. He was then pue n 
the experimental room and seated a b 
console, with his back to the drs pE 
room, The completed questionnaire W 45-19 
on a table near the door, which Y p d 
Slightly ajar so that an accomplice ubje i? 
Corridor secretly could copy the S 
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answ : 
entro] rose accomplice then went to the 
readings du Xs able to manipulate the EMG 
Ject could Heg validation. As far as the sub- 
is qu ell, no one had the opportunity to 
fter mper 
lon in e IB: (including valida- 
asked qo “MG conditions), subjects were 
‘Americ think about “a group,” either 
acteristic. ( or “Negroes.” The labels “char- 
Were ins (3) and “uncharacteristic” (—3) 
the m in the slots on the console, and 
e at x une E proceeded to read 22 traits, 
dee time, 
Page fae report is presented in Sigall and 
More stri we Here we summarize some of the 
alfected Uni results. Some traits were Un- 
Vieldeq y measurement technique and 
For ey; only a Negro-American main effect. 
Ore Xample, Americans were described as 
Negros. cheal and more materialistic, while 
M Ny Were described as more musical. 
Ogica] interesting, especially for methodo- 
considerations, were those traits that 


l 
resul 
ted in ; E 
t d in interactions. These traits tended 
he form of 


the me iiy affect-laden, and t 1 
; mei, pa was consistent over traits: 
1 the ans" were described more favorably 
3 Egr EMG than in the rating condition; 
the ho. s" were described more favorably in 

ing than in the EMG condition. Two 


T 
m n 
OD serve to illustrate. On a 7-point 
at extended from —3 tO 3, “honest 
60) of Amer 


Was p : 
‘tang €ported as characteristic 
5 in the EMG condition bu 


eristi A : 
Stic (—.27) of Americans Jn © 
as viewed as 


nar. 

m dition, while the same trait W 
kya acteristic (—.33) of Negroes in the 
th Condition but characteristic (.67) m 
tating condition. This interaction was 
was 


hie 

rene significant (p < -001)- 

x E.M as uncharacteristic (—.80) of Amer- 

isti, (n the EMG condition and uncharacter- 

tion (—.60), but less so, in the rating condi- 

Neg, ‘Lazy” was uncharacteristic —.73) of 
Broes in the rating condition, but char- 
stistic (60) in the EMG condition. 

k *garding the question of whether the 

it : simply elicits more negative responses; 

Should be noted, that one of the groups, 

Mericans, weceived more favorable ped 


a i " : 
fons in the EMG than in the rating condi- 
n, 


T AND ATTITUDE 361 


Discussion and Conclusions 


The examples reviewed in this report sug- 
gest a few of the various uses to which the 
bogus pipeline paradigm can be put. We wish 
to emphasize the rather unrestricted adapta- 
bility of the paradigm and to reiterate that 
particular procedures and rationales employed 
will vary with the investigator’s interests and 
As we penetrate more and more 
ayman’s assumptions about 
it may be important 


resources. 
deeply into the ] 


physiological processes, 
to distinguish between bogus autonomic indi- 


cators and those alleged to reflect implicit 
muscle movements. For the moment, we are 
inclined to treat these as sufficiently equiv- 


alent to lump them together as one general 
paradigm. 
While the ab 


and suggestive, 
cise information 


ove examples are encouraging 


they do not provide very pre- 


concerning the particular 
f using bogus 


mensurational advantages 0 
physiological indicators. This is especially 
rst two studies where there is 


true of the fi y 
no direct comparative information on the re- 


lation between meter estimates and straight- 
forward verbal measures of attraction. The 
meter-estimate measure seems to give results 
that are more in line than the questionnaire 
measure with theoretical predictions of vari- 
ations in underlying attraction, but there is 
no way of knowing whether this is because 
the meter-estimation measure is more pene- 
trating Or because it was administered first 


and therefore assigned greater importance by 
the subjects. The two Sigall and Page studies 
do provide dire 


ct comparative information, 
but it is far from entirel 


y clear just what is 

being compare 
One of the strongest reasons for preferring 
a bogus physiological measure to a question- 
aang is precisely that the rating is 
able to so many extraneous 


lly vulner: 
ctual influences. Tt seems probable 


naire ! 
potentia 
and artifa u 
that different wordings and contexts of ad- 
ministration would produce rating results dif- 
fering from each other and showing greater 
or lesser differences from meter-estimation 
measures. We know from the Sigall studies 
that electromyographic reading estimates dif- 
fer systematically and understandably from 


paper-and-pencil ratings made on a scale with 
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the same range and verbal label. But we do 
not know if the observed differences would 
remain or be as strong if, instead of using a 
standard paper-and-pencil rating as our com- 
parison standard, Sigall had taken his sub- 
jects aside and said, “Look I want you to be 
completely candid and tell me what you 
really think about that guy,” or “Give me a 
rating that reflects your gut reaction” (cf, 
Aronson & Linder, 1965). 

This discussion moves us toward the more 
fundamental question about why the observed 
differences between verbal ratings and bogus 
physiological measures occur. Here we face 
a number of reasonable alternatives. The first 
and most straightforward of these is that 
bogus pipeline measures operate like lie de- 
tection devices in facilitating scrupulously 
truthful reporting. To embrace this alternative 
is to accept the assumption that subjects do 
not typically tell us the truth when they re- 
port their attitudes and feelings about sensi- 
tive matters, or those where private feelings 
might be expected to differ from a socially 
respected norm. This is not an altogether 
unreasonable assumption—especially if we 
remember that all it takes is a few subjects 
who conceal the extremity of their feelings in 
the rating condition to produce a reliable dif- 
ference between condition means, 

A variant of this altern 


breference for negative o 
Let 


ative is to posit a 


icular rating point (cf, 
Part of this in. 
the subject’s aware- 
ness that his feelings about persons or his 
o vacillate over 
with mood, cir- 
n asked to esti- 


g relatively more 

: affect. All that is 

] »1$ to make the one further as- 
sumption that the person y 


; 4 vould rather err on 
the side of being too harsh than too lenient, 
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ample; 
To take the stereotype study as NS 
it may be better to admit being a is fairly 
have it shown that you are rea ile peing 
liberal than to claim tolerance p. 
revealed as a bigot. The sies, for 
hedges his bets, making it less likey odi 
latter, less preferred, error to E that 
whole, this is an interesting altern thus fä 
is consistent with most of the data unt fof 
collected. However, it does not aec E in 
the fact that subjects were more pc was 
describing Americans when the d favor 
employed—unless one assumes tba 
able rating of Americans is an ind 
negative attitudes toward others. tible with 
A third alternative, not pcm E s 
those already mentioned, is ies affecti" 
physiological apparatus makes PUN salien 
component of an attitude or evalua s 1i you 
over the more cognitive compon 
ask a person, “What do you thin : 
respond differently than when m 
“How do you feel?” Similarly, omic 
subject to predict what his auina o foct* 
vous system is telling the mache comp? 
him on the more primitive, affect? struct 
nents of the impression or attitude So ifie 
The resultant response may pee pont 
from the verbal rating that is equa ns of Ut 
but that reflects different qeu rue g 
attitude being measured. If this W iu vite e 
would cast the stereotype findings aed tie 
different light than if one were be jer ie 
lie detection emphasis of the first F pat ot 
It is easy to jump to the concluso im 
White subjects in the BUMEDHIPL oe Ht 
are really less tolerant and more. is cert?! 
they profess to be. Although this Je tha ent 
conceivable, it is equally conceivab giffe" 


e 
ji o 
subjects are trying to cope with T poth: Ts 
fasks and responding honestly in ect 
Sardless of our own particular rà ul 


We have all heard much abont 
and probably have well-developed hat Pipe 
about the actuarial likelihood out’ i 
have deep-seated negative feelings vom ti 
racial groups of which they s affe" vc, 
unaware. When asked to estimate t m 
component of their racial attitudes, he. y 
it is not altogether surprising ihat licatio" D 
accede to some extent to the imp natt" 
this stereotype concerning the 
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des A similar argument could be de- 
of esl concerning the actuarial likelihood 
e den negative affect in cases where at- 
Ate lon is being measured. People may have 
Ind of id-ego conception of their attitudes. 
ed may assume that their ugly, negative 
ister are buried in the autonomic nervous 
p and are expressed only in offguard 
Een d that portion of the attitude that 
d rols behavior, on the other hand, is tied 
= osely to conscious, rational processes in the 
Intellect, 
p it is impossible to choose among these 
e natives with our present available knowl- 
den] it is important not to embrace too 
ad the proposition that the bogus pipe- 
1S necessarily a better or truer measure 
ae attitude than the standard devices and 
Pecially that physiological estimates are a 
tio, Predictor of overt behavior than ques- 
m Maire measures of attitude and attraction. 
ES ie to the stereotype study, the results 
Ein necessarily imply that someone who 
m ates a negative reading to the stimulus 
E p Will therefore be likely to discriminate 
indivi Negroes or to react with hostility to 
tude ual blacks. Undoubtedly, different atti- 
5, and attitudes held by different persons, 
ifferently structured and forecast overt 
ior with differential precision. It is quite 
€ that the more cognitive questionnaire 
Ponse to an item about race reflects the 
a person wants to see himself and the 
which he tries to behave. He may have 
© reservations about how his autonomic 
Neverthe system feels about Negroes, but 
Mort eless feel that he is in all the more 
temaj ant respects an unprejudiced man. t 
bogus t0 be seen, certainly, whether the 
y mu Dipeline is any more behavioroid than 
Con og, onnaire response, though in many 
trans it should be freer of misleading 
y Sous determinants. 
‘üg 5» different alternatives discussed à 
alten st different research programs. The first 
Et ative; that the bogus pipeline provides 
Verso " measure than other methods of how a 
m e honestly feels, leads to experiments 
me ate, action of feelings in highly senst- 
tet ilie or after interpersonal experiences 
Shs, Use unpleasant or embarrassing emo- 
he final alternative considers the bogus 


a 


Som 


bove 


pipeline as a method tapping a different 
aspect of attitude. To accept this seriously 
could lead in the direction of research on the 
structure of different kinds of attitudes and 
to an exploration of meaningful discrepancies 
between cognitive and affective dimensions as 
a function of issue and setting. It is our 
belief that research along both of these lines 
would prove interesting and constructive. By 
either avenue we shall begin to learn more 
about the multiple processes of self-attribution 
and the nebulous phenomenology of emotional 


experience. 
REFERENCES 


Aronson, E., & CartsmitH, J. M. Experimentation 
in social psychology. In G. Lindzey & E. Aronson 
(Eds.), Handbook of social psychology. (2nd ed.) 
Vol. II. Reading, Mass.: Addison-Wesley, 1969. 

Aronson, E. & Lryver, D. E. Gain and loss of 
esteem as determinants of interpersonal attractive- 
ness, Journal of Experimental Social Psychology, 
1965, 1, 156-172. 

Bemar, L. B. The effects of cognitive dissonance on 
inappropriate emotional reactions. Journal of Per- 
sonality, 1967, 35, 505-519. 

Bramet, D. A. A dissonance theory approach to de- 
fensive projection. Journal of Abnormal and Social 
Psychology, 1962, 64, 121-129. 

BRAMEL, D. Interpersonal attraction, hostility and 
perception. In J. S. Mills (Ed.), Experimental 
social psychology. New York: Macmillan, 1969. 

Breum, J. Wọ, & BEHAR, L. B. Sexual arousal, de- 
fensiveness, and sex preferences in affiliation. Jour- 
nal of Experimental Research in Personality, 1966, 
3 s nd T. The indirect assessment of social 
AMPBELL, D. T. The in 

ko ro Psychological Bulletin, 1950, 47, 15-38. 

CAMPBELL, D. T. & Fiske, D. W. Convergent and 
discriminant validation by the multitrait-multi- 
method matrix. Psychological Bulletin, 1959, 56, 
81-105. 

Coox, S. 
approach 
cm Personal and dissonance. 

E ae of foreseen consequences. Journal of 

Personality and Social Psychology, 1971, 18, 354— 


C. A multiple-indicator 


, & SELLTIZ, t 
I asurement. Psychological 


to attitude me 


1964, 62, 36-55. —— 
1 responsibility 


Eo pR Emotion in prejudice. Science, 1959, 
, J. B. 


314-318. 
ge P., & MARLOWE, D. The approval mo- 
ne, D. 


s. New York: Wiley. 1964. ; , 
tive. New "I The social desirability variable in 


ARDS, G 
poon assessment and research. New York: 
ji 1957. , . 
nm J., & FRASER, S. C. Compliance without 
Put: The foot-in-the-door technique. Journal 
of Personality and Social Psychology, 1966, 4, 
195-202. 


364 


Gerard, H. B. Physiological measurement in social 
Psychological research. In P. H. Leiderman & D. 
Shapiro (Eds), Psychobiological approaches to 
social behavior. Stanford: Stanford University 
Press, 1964. i 

Gerarp, H. B. Deviation, conformity, and commit- 
ment. In I. D. Steiner & M. Fishbein (Eds.), Cur- 
rent studies in social psychology. New York: Holt, 
Rinehart & Winston, 1965. 

Geraro, H. B, & Rass, J. M. Fear and social 
comparison. Journal of Abnormal and Social Psy- 
chology, 1961, 62, 586-592. 

GorrMax, E. Stigma. Englewood Clifís, N. J.: Pren- 
tice-Hall, 1963, 

Hess, E. H. Attitude and pupil size. 
American, 1965, 212, 46-54. 

Jones, E. E, BELL, L, & Aronson, E. The recipro- 
cation of attraction from similar and dissimilar 
others, In C. C. McClintock (Ed.), Experimental 
social psychology. New York: Holt, 1971, in press. 

Jones, E. E, & DECHARMS, R. Changes in social 
perception as a function of the personal relevance 
of behavior. Sociometry, 1957, 20, 75-85. 

Jones, E. E., Stres, L. K., Snavzn, K, G., & Harris, 
V. A. Evaluation of an ingratiator by target 


persons and bystanders. Journal of Personality, 
1968, 36, 349-385. 


KARLINS, M., COFFMAN, 


Scientific 


T- EE Watters, G. On 

the fading of social stereotypes: Studies in three 
generations of college students. Journal of Per- 
sonality and Social Psychology, 1969, 13, 1-16. 

Lre, C. D. The instrumental detection of deception. 


Springfield, Ill.: 1953. 
LEIDERMAN, ds.) Psycho- 


biological or. St: A 
dE r. Stanford: 


R,& 

mitment to action taken Mee i vua 

nitive dissonance. Joy: 

Psychology, 1965, 2, 864-868. 
McGune, W. J. The nature of attit 

change. In G, Lindzey & 

Handbook of socia 

III. Reading, Mass.: Addison. 
Pina, J. L. Who 


udes and attitude 
E. Aronson (Eds.), 
- (2nd ed.) Vol. 
-Wesley, 1969. 
shall survive? Washington, 
Mental Disease Publishing 


Psychology of the psy- 
h particular reference to 
CS and their implications, 
t, 1962, 17, 776—783. 


xperiment: Wit 
demand characteristi 
American Psychologis 


EDWARD E. JONES AND HAROLD SIGALL 


tod onse 

Porter, G. W., & Lort, A. J. Galvanic MA 
and prejudice. Journal of Personality a 
Psychology, 1967, 5, 253-259. ic skin 

RANKIN, R. E, & Campsert, D. Ts: d Jour- 
response to Negro and White experiien 1935, 51 
nal of Abnormal and Social Psychology, 

30-33. 

Rrecken, H. W. A program for T x ss 
ments in social psychology. In N. "Vol. 2. 
(Ed.), Decisions, values, and groups. ; 
York: Pergamon Press, 1962. ils; On elimi- 

RosENBERG, M. J. When dissonance fail Gunde mea- 
nating evaluation axpienenston Ank 7 Social Psy 
surement. Journal of Personality an 
chology, 1965, 1, 28-43. ; m 

Scnacuter, S. Deviation, rejection, ana, SUR 
tion, Journal of Abnormal and Socia 
1951, 46, 190-207. TT 

Scorr, W. A. Attitude measurement. ci y 
& E. Aronson (Eds.), Handbook ond Addi- 
chology. (2nd ed.) Vol. II. Reading, Ma b 
son-Wesley, 1969. :ological ap 

Sumarmo, D., & CRIDER, A. Psychophys ey & H 
proaches in social psychology. In e 1 psycholog”: 
Aronson (Eds.), Handbook of Roe Addiso" 
(2nd ed.) Vol. III, Reading, Mass 
Wesley, 1969. -y gudgntt ; 

Smer, M., & Hovrawp, C. I. Social Pica ii, 
Assimilation and contrast effects in 6 Univers! ) 


and attitude change. New Haven: Yale 
Press, 1961, 


n exper 
purne 
New 


munica- 
nology 


The 

Sicut, H. ARONSON, E, & VAN Hon a nal D, 

Cooperative subject: Myth or A 6, 10. no 
Experimental Social Psychology, 1910; nit 


Sicart, H., & Pace, R. Current stereotypen ity an 
fading, a little faking. Journal of Tert 3 
Social Psychology, 1971, 18, Ww. R. A 

STERNBACH, R. A., Gustarson, L. A., & " usi 
Don't trust the lie detector. Harva d 
Review, 1962, 40 (6), 127-134. trate ro 

Vatins, S. Cognitive effects of false pon syo” 
back. Journal of Personality and Soc "m 
08y, 1966, 4, 400-408. one ios, 

Weste, F. R, & DrFrrum, M. L. A e^ 
sponses and their relationship to ra 1 


p »cholog' 
Journal of Abnormal and Social Psych í 
58, 340-347, s 
ZIMBARDO, P. G., Comex, A. R., Wee controle 
Dworkin, L, & Firestone, I. 3 (Eds gi: 
experimental pain. In P. Zimbar Glenview? 
cognitive control of motivation. 


Scott, Foresman, 1969. 


(Received April 10, 1970) 


Psychological Bulleti 
1971, Vol. 76, No. 4 365-377 


MEASURES OF RESPONSE AGREEMENT FOR QUALITATIVE DATA: 
SOME GENERALIZATIONS AND ALTERNATIVES 


RICHARD J. LIGHT 


Harvard University 


Various procedures are available for measuring agreement among two or more 
observers who classify responses among nominal categories. But different problem 
situations naturally require different measures. Using the general model of a con- 
tingency table with fixed margins, this study suggests a measure of level of agree- 
ment among several observers when they are compared “internally”; a conditional 
measure of agreement for several observers compared “internally”; a test for the 
ral observers when their categorizations are compared with 


joint agreement of seve 


a “standard”; and a statistic for evaluating the pattern of a; 
ervers, The first two of these measures are extensions of Cohen's « statistic. 
h situation, and results of a Monte Carlo study 


two obs: 
Illustrations are presented for eac 


of the behavior of the pattern agreement statistic 


p in psychological experiments two or 
ü um servers each assign responses to one of 
assion of nominal categories. Assume these 
E are made independently. A re- 
EN may then wish to develop a measure of 
wheth nen for these observers and to test 
8e ned us measure differs significantly from 
evel ce” agreement, or any other prespecified 
a 3 bin of recent studies have suggested 
an "Ee for studying observer agreement, 
First Tom these studies several issues emerge. 
ip b What is the relationship between measur- 
| ie utm and measuring agreement? 
dition k should measures of agreement be a 
ties? me observed marginal prokaht i- 
Perea, Se can the traditional eil 
Bree, be fruitfully applied to measuring 
ent? 
Wes this study we will consider these iss 
ane! argue that agreement isa special case 
tv SSociation, We will then distinguish several 


eq ES of agreement and suggest that each type 
: measurement 


uir s 
Doe a different sort of ) 
foy a dure, Finally, we will present extensions 

Nd alternatives to techniques currently 


Use, 
for a measure agreement. Structuring data 
Cong; * Agreement problem in the format of a 


wep Eency table enables us to utilize some 
Down results to develop these measures. 

he Th Observer assigns each of 7 objects to 
Us, Reg C nominal categories, then the contin- 
She Pests for reprints should be sent to Richard J. 


"yg, Oratory of Human Development, Harvard 
Y, Cambridge, Massachusetts 02138. 


ues. 
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greement between 


are discussed. 


gency table of Figure 1 is specified. The 
categories on the rows must appear in the same 
order as the categories on the columns, which 
permits us to define agreeing items as all those 
items that appear on the main diagonal of the 
table, since both observers assign such items to 
the same category. Goodman and Kruskal 
(1954) described this case as appropriate for 
measuring reliability. Note that two rows 
cannot be permuted unless the corresponding 
columns are similarly permuted. Thus, the 
ordering of categories on rows and columns 
must be identical. 

While the layout in Figure 1 is most fre- 
quently interpreted as two observers’ cate- 
gorizations of items, it can be viewed 
alternatively as each of 27 observers! cate- 
gorizations of a single item. In this latter case, 
n observers of one type (e.g., mothers) would 
n of a second type (e.g., fathers), 
e 2n paired observers assigns an 
this case, 
might be 


be paired with 
and each of th 
item to one of C categories. For 


rather than measuring reliability, we 


Observer 2 


Observer 1 Nj 


Fic. 1. Notation for two-observer agreement. 
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Mothers 
1 2 3 
1 50 50 
Fathers 2 50 50 
3 50 50 
50 50 50 150 


Fic. 2. Responses with perfect. association, 
but zero agreement. 


interested in estimating an overall level of 
agreement "within families." 

For the special case of C = 2 categories, 
Measuring agreement in the resulting 2 X 2 
table is equivalent to measuring association. 
Thus, chi-square-based measures such as the 
contingency coefficient and phi, or other mea- 
sures such as Q, may be used (see Kendall & 
Stuart, 1967). But when C exceeds 2, the 
standard measures of association may be 
inappropriate for measuring agreement. 

The distinction between agreement and 
association for nominal data is that for two or 
more responses to agree, they must fall into 
the identical category, while for two responses 
to be associated requires that the category of 
one response can be predicted from knowledge 
of the other’s Category. Thus, agreement is a 
Special case of association, and a table may 
ation, as 
ement and high association, 

t illustrate this distinction, 
Figure 2, a presentatio; 

150 mother-father Pairs in answer to a single 
multiple choice question with nominal alterna. 
tives 1, 2, and 3. As only Tesponses on the main 
diagonal indicate agreement, Figure 2 indicates 
no agreement; yet, there is perfect association, 
Given the father's responses, the mother’s 
responses are perfectly Predictable, and vice 
versa, although within-family 


consider 
n of paired responses of 


agreement is 
Zero, 
Country | Country 2 
Moe Mothers 
1| 60 | i : 
| 20 | so il 2 | 
Fathers BPS ara) Fathers L9 pe» ba 
2| 20 | 0 | 20 2 0 20 20 
2 2 2 
————Ó— —LRE Maa 
80 20 | 100 20 80 | 100 
Fx6,.3; Example illustrating difference between absolute 
versus re] 


ative agreement, 
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| underlying 

Note finally that the model u of Roy 
Figures 1 and 2 has, in the termio ny xo 
and Mitra (1956), both margins varia "ss fy 
observer can assign as many of the # n two 
any category as he wishes. pe different 
observers may have, because aie asym- 
assignment procedures, "pem. ca 
metric margins. This possible asy n ereement: 
create difficulties in interpreting Figure i 
Consider the contrived example in ne REINES is 

A single question with two qe in OF 
presented to 100 mother-father pa : within- 
countries. Which country has m "omi the 
family agreement? In Country r agree: 
pairs agree, while in Country 2 yo vn 
Yet, given the constraints of d E: le 
Country 1’s pairs display the umen i lay 
agreement, while Country 2's pa roblem 9 
maximum possible agreement. E Los agree 
disentangling absolute versus re dman e. 
ment, noted, for example, by Go od foL 
Kruskal (1954), suggests the ne 
measure conditional on the margins. 


veen TWO 
Measuring Level of Agreement betwee! 
Observers 


i; n done 
Historically much work has bee! 


« or TÈ 
developing measures of agreements, ^ (951) 
bility, for continuous data. 1058); any 
Robinson (1957), Haggard m pave E. 
Burdock, Fleiss, and Hardesty (19 ied du 
considered this case. For ran* ir 


n 

Kendall’s W (1962) is well m a weve 

quently used. For qualitative data euge tiori 

only in the last few years have some illustra" of 

emerged, Because of the problem N ptio” 

in Figure 3) of using the raw proP al! 
c 


pif 
"E 
agreement EET these measure "n 
Conditional in some way on the NT 
margins. sure C 
Cartwright (1956) proposed a meas e agre 
alpha, based upon the number of Sr nse? ape 
ing versus disagreeing pairs of Te propaga 
the required assumption of uin m yd 
categories” restricts its use. Ano arginal t 
of agreement, sensitive to the m: me of! 
which would “correct for chance bap 08 
was developed by Scott (1955), w 


Po Pe, 
ae 


Lf 
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Where p, is the observed proportion of agreeing 
r C 

Pairs, and p, is X pi, where px is the proba- 
iei 


bility of an entry in cell i, i on the main 
diagonal of a C X C table. The ps were com- 
Puted, however, on the basis of known popula- 
lion marginal probabilities, and the assumption 
that both observers had the identical marginal 
distributions of proportions, equal to the 
known population values. Cohen (1960) agreed 
that 7 was conceptually useful, but noted that 
actual data often did not meet the assumption 


of symmetric marginals. He therefore 
Suggested: 
Po— be [2] 
Ko Vi 
1— b. 
1€ e 
Where 5,— — X nu and pe= 75 E nones 
"i-i M isl 


k thiis has p, based upon the observed margins, 

a Tequires no assumption of identical 
marginals, 

Note that the « statistic in Equation 2 takes 


the d 

he general form 1— 2 where d, represents 

ob "e P 

ed proportion disagreement, and d, ex- 
Cted proportion disagreement, under the 

to bothesis of random agreement. Using this 
mulation, « can be expressed as: 


[^i 
» "ua 
Esp ow "UE n 7 
n [3] 


Th ; 
Us X becomes a ratio of measures of distance, 


whero ereements, between the two apriva 
Series distances are measured hy GAUGE D 
Sures of ones and zeros. These aee 5 = 
bite simply a function of the numbers : 

MY versus disagreeing pairs 1n the 2n tota 


"sponses, 

leat series of further studies, Cohen (1968), 
(toce tt (1968), and Fleiss, Cohen, and Everitt 
Der 2 Presented a weighted version of x, which 
of g. tS differential weighting of different types 
lay agreements between observers, aS well as 


Sample standard errors for x and weighted 


jt scs d$) ads 
Present, the simple statistic j^ iis 
Which " j ni=l 
Varig; Bhores the marginals entirely, * and a 
‘on of Cochran’s Q statistic proposed by 


c 
E niti 


k, 


Fleiss (1965) designed for dichotomous re- 
sponses are the most frequently used agree- 
ment statistics. Because of its attractiveness as 
a simple distance measure of agreement, the x 
statistic is extended in the next three sections 
to measures of conditional agreement and 
agreement among more than two observers. 


A Conditional Measure of Agreement Level— 
Two Observers 


Just as when measuring association in con- 
tingency tables it is often useful to measure 
conditional association, similarly, when mea- 
suring agreement, being able to partition an 
overall x into a set of partial x’s, Kp, i =1,...,C 
can contribute additional insight to a data 
analysis. If the data consist of two observers 
categorizing items among C categories, then 
we might ask about the agreement between the 
two observers for only those items which one 
observer placed into the ith specific category. Or, 
if the data were drawn from n pairs of ob- 
servers, such as # mother-father pairs where 
each of the 2# persons assigns a single item to 


of C categories, we might ask about the 


one a 
amilies where the 


agreement in only those f 
mother selected the ith category. 
In some unpublished work, Coleman? has 
used the idea of summing agreeing and dis- 
airs to develop a conditional version 
of x. Taking a C X C table, a measure of agree- 
ment between the two observers for those 
items which the first observer (appearing on 
the rows of the table) assigned to the ith 


specific category is: 


agreeing p: 


A similar conditional measure for the second 
observer (on the columns) could be obtained 
by merely transposing the columns and rows 
of a table, and applying Equation 4. Alterna- 
tively, i+ and n+; in Equation 4 could be 
interchanged. 

x illustration of xp; consider a study of 
915 father-son pairs. Each of the fathers and 
each of the sons was asked whether the son or 
the father was more likely to initiate conversa- 
feasures of concordance or con- 
ers of social groups. Unpublished 
University, 1960. 


2 Coleman, J- S. M 
sensus between members 
manuscript, Johns Hopkins 
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FATHER'S REPORT 


Father Son 
more more 
Father more| 160 65 225 
Son’s Report 
Son more 348 342 690 
508 407 915 


Fic. 4. Report of who initiates conversation. 


tions at the dinner table. The results are given 
in Figure 4. An overall measure of father- 
agreement given by « is: 


i mi) 
— 915 T 
i (225) (508) + (407) (690) ) rox 
E (915)? 
Now we focus on specific families in which the 


Son gave certain responses. For families in 


which the son reported that the father initiates 
More, the agreement is: 


son 


k= i= 


The correspondin 
which the son re 
more is: 


E measure for families in 
ported that the son initiates 


Finding a positive x, and 
Scribes the dat 
to report th 


a negative kp, de- 
4 very well, Overall, sons tend 


at sons initiate, while fathers pre- 
dominantly report that fathers initiate. We 
would expect, therefore, that in families where 
the son reports the father to be the prime 
Initiator, there d to be son-father 
here the son reported 
iator, fathers would be 
l - This is exactly what the 
Kp; Measures indicated, 

To evaluate the 


Fleiss et al, 
arge sample variance 
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B arag b 
of x, a computationally simple large 54 


approximation would be: 


hio, qu 
Var(kp;) = n(1— po? 
where m "E. 
= > anc e ur 
bem ad po, 
Pe — ndë 
which becomes Varo(«p,) = n cp 
sement. Fog 
the null hypothesis of random ager while 
the data in Figure 4, Varii) =: using th 
Vara(kp,) = .001. We thus conclud a that bol? 
large sample normal approximatior ? at] beyo” 


1 soni is w 
Kp and xp, are different from zero, 
the .01 level of significance. 
g M 
Amon, 
Measuring Level of Agreement AN 
Than Two Observers 


can 
ont 
EI zreeme e 
The x approach to measuring ag tion Whe! 


be extended further to the situa o server? 
more than two observer: (say, i zorie Th f 
each assign » items among C ge idea a 
extension follows directly from “eii of M 
taking pairs of observers? pr Ad a seg 
item, and seeing whether both obse to differ 
the item to the same category ; the multiP f 
categories. The general form of g's form 
agreement statistic is identical to ion 9 

d, i —— propor a 
1 — 7 where d, is the observe mui 

d, d prop? jal? 
disagreements and d, the epek NE 5 oat 
of disagreements given the m o case Y a 
ginals. Consider for example se agree? 
m = 3. Then the multiple observer * 
Statistic x, can be written as: 


yet 


2} 3 ja dH 
A= È nyg Egan E Myj 
ij iz jet 


1 
> yet 
i E ona ; 
Regd & Wey 4 yj ru 
F Z ni E nigan s x y 
ET dej jr -€— "a 
Nit 
Hyg E nualtait A 
+E ngyn E nie ial 
iu ixj 
and m? 
X ong 
B-32Zniqm,, EX nian PH 
izj il 
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DAUGHTERS 
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three observers. 


1 
FATHERS 
g 
MOTHERS 
Fic. 5. Notation for measuring agreement among 


Wher 

oe o represents the number of items 
Categor ME first observer assigns to the ith 
jth Ks the second observer assigns to the 
to th ategory, and the third observer assigns 

i € lth category. 

Km can estimate either the joint agreement 
ie observers categorizing # items, or the 
Dies (eo, among # m-tuplets of observer 
fategori t! n father, mother, daughter triplets) 

A izing a single item. 
hp, PEDES large sample estimate of the 

ce of xm can be found from: 


Lb — po) y 
Var(km) S wo ray I2] 


wh j i 
lere Qo a and pe=1— 5 with A 
n° n 


quation 6. For testing the 


nd Fs 
B defined in E : 
= 0, of no joint agree- 


nul 
M i hypothesis that Kn 
» Putting p, = pe yields: 


"mum 
3 Varole) = a(t — pe)’ [8] 
Po ie is the same as in Equation te 
n om "ustrate the computation Of Km, consider 
a ilies, each consisting of a mother, father, 
“aughter, with each member selecting one 


of two categories, 1 or 2. The notation can be 
viewed in Figure 5. The 12 responses in the 
table are marginal totals. 

Figure 6 presents hypothetical data for this 
example. Note that the within-cell data that 
result in the marginals of Figure 6 consist of 
the following eight responses: 


mu = 5 
mi = 5 
113) = 40 
1323 = 40 


Computing first Km, from Equation 6: 


A = (80)(90- 50 + 10-50) (10-50 + 90-50) 
+ (0) (90-90 + 10- 10) (10-50 + 90-50) 


-- (50) (90-90 + 10- 10)(90-50 + 10-50) 
= 6.1 X 10° 


B= 3(8200) (5000) (5000) = 6.15 X 10". 


So, 
nA 


km=l—- p= B 0i. 


t= 1004 = 


To find Varo(km) from Equation 8: 
B 615% 10" g 
615. 


(1—29 = 55 = 10,0 x 101 ^ 


Fic. 6. Set of marginal totals. 
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Hence, 


Bode! ois. ~ 
Vato(km) = 100(.615) ^ 0063, 


which yields an estimated standard error of 
Solkm) = .08, 

From the large s 

Km/So(km) = 312; 


Conditional Ag 


reemen|— 
Observers 


More Than Tw 


10 

Just as the conditional measure «p; could be 
defined for two observers, the concept can be 
extended for m > 2. Once again this extension 
proceeds by summing pairs of agreements and 
disagreements, 

For m=3, a measure of 
agreement for those families 
observer" assigned an ite 
(any observer type, 
designated as the 


within-family 
whose “first 
m to the ith Category 
€g, father, can be 
"first observer”) js: 


n(Zn; Zn, |i -En Hin) izj 
j L i 1 
Kmp = 1 — ^1 


2nj, 245, Xn 
i 1 


Hi? il. 
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Kn pt 

©... -0 of Km? 
p ' ce 

The large sample approximate Ms 

can be estimated once again from: 


pap), ^ pU 
n — pe)? 


Var(kmp,) & 
where now, 


/ 
DSTI) 
(1—p,) = (Ennyi Tx "v 

4 g 


j Zk 
"x 
(Qna)! a 
and 
2 " ) 
— p) = 5 (n, nan 
(1— p.) sje ui MT 


e ge 
To test the hypothesis that rmp: p : jog 
8 with p, from Equation 10 prov ye 
Sample estimated variance. . f mp? adl 
To illustrate the computation P i ond 
return to the data of Figure 6. The which b. | 
agreement for those families in " jg fo | 
athers selected the first response 
from Equation 9 to be: 


4 _ 100£(80)(50) + (0037 


2(90)(90)(50) 


Kmp = 
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Observer 1 


maeO | ays) |m 


naD | na | roe 


1) an 


(ey : 
Orrect assignment 2 | ja? 


3 us 


nasO | nas pans. 


na ns onus |n 


Fic. 7. Data format for comparing joint agreement of se 


n computations for the five other Kmp: 

of Fi € following set of results from the data 

each th 6; there are six partial measures, as 

aa the three family member types chose 
Wo Categories. 


Parti 
rlial Agreements 


Fathers Mothers Daughters 
mm = O1 Kmm= S50 m= — 30 
Ag, SO tay 0 tay m — 0 


p" thus see that while for daughters 
M ccs b for both mothers and fathers 
gives Km 23 F'or computation, since Equation 9 
eg Fe in terms of a fixed row selection 
Tha" athers who choose row one), to find 
Probab mothers (and then daughters) it is 
the ^ y simplest to interchange subscripts of 
au s in order to view mothers (and then 
thes ters) as being indexed on the rows with 
Subscript i, 


(o TA 
ind the Joint Agreement of Several 
"ers with a Standard 


anv’ now focus on a slightly different problem. 
two à the K-type measures of agreement have 
loj, non properties. First, et ae 
Of i aoe about a “true, or correct assignment 
Obsc. 1$ to categories. Responses of two or more 
‘Sta, Vers are not measured relative to any 
they dard," but rather only with respect to 
of ie consistency. Second, in the case 
S rers n two observers, pes p 
mn, nterpy, ep equal weight. Thus, £m a 
sure. eted as an “overall group agreem 
n 
‘Wo RUN Situations, however, neither of these 
Drop, op erties will be appropriate. For some 
Porn s it may be reasonable to assume à 


l sen H ic 
Set” of responses, to which we wish to 
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Observer 2 


m2 [ons | my 


ny? 


Correct assignment 2 
3 


na (n4 O 1243 |n 


veral observers with a standard. 


compare the responses of one or more observers. 
For example, when a medical doctor or 
psychologist believes that he has correctly 
classified » patients into C symptom categories, 
he may wish to compare the categorizations of 
his students, taken as a class, with his “correct” 
assignments. In other situations, even if we 
are not willing to assume a set of “correct” 
assignments, we may be interested in isolating 
one member of a group as being of special 
interest, and comparing the responses of the 
“rest of the group” to those of the isolated 
member. Thus, even if we do not assume the 
isolated member’s categorizations to be “cor- 
rect” in an absolute sense, we still use this 
member's responses as a standard against 
which to compare the rest of the group. For 
these situations, xm would be inappropriate, as 
it does not focus on any single group member. 

Guetzkow (1950) has suggested several 


measures to evaluate agreement among ob- 
when each item has 
He found that 
hen more than 
ed is primarily 
determined by the probability of all observers 
selecting the “correct” category. Tukey (1950) 
has pointed out, however, that these procedures 
assume a “response is either correctly classified 
h one hundred per cent certainty, or it is 

51]" which is an 
erefore consider an 


servers categorizing items, 
a specified “correct” category. 
agreement among observers w. 
four or five categories are involv 


wit 
classified at random [p. 
uncommon situation. We th 
alternative procedure. 
Assume that a “correctly classified" set of 
n items, in C categories, is independently 
categorized by each of m observers. When 
1, then x serves as a measure of agreement. 
nm > 1, then km measures the “joint 
agreement” between the (m+1) sets of 
categorizations, which treats the correct 
allocation as “just another observer’s alloca- 


m= 
But whe 
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Colleague 1 


Normal Neurotic Psychotic 
Normal 12 6 2 20 
Psychologist Neurotic 4 10 16 30 
Psychotic 24 4 22 50 
40 20 40 100 

Colleague 2 

Normal Neurotic Psychotic 
Normal 10 3 7 20 
Psychologist Neurotic 14 15 1 30 
Psychotic 6 22 22 50 
30 40 30 100 


Fic. 8. Psychologist’s categorizations versus 
tion.” We thus need a tes 
ment of the m o 
allocation. 


Let s; represent the number of responses 
in the i,j cell of the pth table; ij =1,...,C; 
p=1,...,m. For example, suppose m = 2 and 
C = 3; then the layout in Figure 7 represents 
the data. Note that for these two tables in 
Figure 7, and in general for all m tables, the 
Tow margins 7;; must be identical for all tables, 
but the column margins 71,5 will often differ. 

To test the hypothesis of “random agree- 
ment" of the group of m observers relative to 


the "correct? categorization, the statistic G 
can be used: 


t for the joint agree- 
bservers with the “correct” 


Nm; E(tm) 


as ? izations. 
each of two junior colleagues’ categorizati 


where 
T pus 
lm= È Eny 
p=l i=j 


c m 
E(t,) = i D [n4(X 143) J, 
Hiini = 


i=j p=1 
and 


g m 
Hm =—* E mald ny] 


p=1 


n 5 , 
i E [rey E n” J 


= n(n — 1) i=; pci 


«p))?) pud 


1 E Su 
MN 11 ——— M2 sam (aj 
Gaye 79D [At] Ht a es 
a. Observer 1 b. Observer 1 
1 2 3 1 2 3 
6 |10 
1| 4 2 4 |10 P 9 dl er hi 
Observer 2 2 8 & 8 |20 Observer 2 2 8 6 | ee " 
3| 8 4 8 |20 3| 10 2 8 E 
— = — 2o | 80 
20 |3 
?0 — 10 20 |50 20 — 10 x 
k-0 xm 


Fic. 9. Responses illustrating how patterns of 


scale 
identic 
2 : AP ent are ide! 
agreement can differ while levels of agreement 


TM 
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where in Equation 11c above: 


Z = nyin Qui) (ya) 
p 


F nog (141) iua + + 
P 
+ nes D (na0)uo0)] 
p 
Tons È (n4) ta) 
p 


+ fis E (nyo )(n p) Tee 
+ ncs (nyo) (n4o™)] 


p 
+ neun E (nye) (t4) 
$ 
Fna (nac) Gp) H 
? 
+ ne E (nyo) 0e )] 
? 


S Compares the observed agreements of each 


” sion- 
Men, "t respondents with the Bonos. pat 
ent, while recomputing the expected nu 


agree: based upon 

pE Cements for each respondent nic, 
Margin assignments MPs: 

bs Si assign d from the well- 


k >+.. m. G is developed tr i^ 
Dow results that for any contingency 
en fixed marginals: 


able, 


nita 
En) =~ 
ing ——- 
diis D 
Vi nata — Diest 
rii n(n — 1) 
» 1 the same row or 


E for any two cells not 1r 
Sa 
Me column: 


nighyr Msi j g, GAN. 


Co 
Nus ny) — Ca n— 
» entries along 
JD TS m of the en : 
th he variance of the sul a single observer, 


RS aL p al, for 4 
ig; "Sreement diagonal, 


c 2 $ Cov (itishtao)- 
E Var(u) + deo 


i=j 
homing these single Cer with the pairwise 
x. % observers, inc tables, yields the total 
Va, Ariances over ta SUR G will be approxi- 
Nance p, For ee qributed around Z(,) 
Ately normally diS 


: observer variances over 
t 
IN 
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under the null 
agreement.” 

As an example of the use of G, suppose a 
psychologist classifies each of 100 subjects into 
one of three classes; normal, neurotic, or 
psychotic. Suppose further that each of two 
junior colleagues independently categorizes 
the same subjects, and we wish to see whether 
the joint assignments made by the two 
colleagues “agree beyond chance” with the 
ratings of the psychologist. The data appear 
in Figure 8. Computing fm, E(/,.), and čin from 
the formulas in Equation 11: From Equation 
11a, im = 91; from Equation 11b, E(¢n) = 67; 
from_Equation 11c, 


hypothesis of “random 


1 
p ag (6,700) + 5 (22,450,000) 


Stm 990,000 
DT 1 ] W- 
= 9,900 257,000) = 9,909 235,000) = 40.7. 


A test of the hypothesis of random agreement 
is given by: 

91— 67 24.0 

G= anni aa = 

(40.7) 
Whether to use a one-sided or two-sided test in 
this problem is up to a particular researcher, 
but in either case, the value G — 3.75 is 
significant beyond the .01 level, and we may 
conclude that the joint categorizations of the 
two junior colleagues agreed with the psy- 
chologist's categorization significantly more 
than we would have expected due to chance. 


Distinguishing Patlern from Level of Agreement 
between Two Observers 


We return finally to the case of m = 2 ob- 
servers. The statistic « gives a measure of how 
the observed overall level of agreement compares 
with the expected overall level. It is completely 
insensitive to differences between the observed 
and expected patterns of agreement. For 
example, both of the 3 X 3 tables in Figure 9 
have x — 0, and identical margins, However, 
while the observed levels of agreement displayed 
by both Figure 9a and Figure 9b are identical 
to those expected by chance, the patlerns of 
agreement, which contribute to the overall 
level, are different for the two tables. Figure 9a 
has the agreements distributed exactly as ex- 
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a. Observer 1 


1 
Observer 2 2 


Fic. 10. Data illustrating why standard chi-squ 


pected due to chance, on the agreement 
diagonal. Figure 9b has the agreements dis- 
tributed differently from expectation. Yet x for 
both figures is zero. 

An agreement measure that is sensitive to 
deviations between obser 
counts in particular cells 
Cohen (1968) noted thi 


out that the standard chi-square approach to 
measuring association has sometimes been used 
to measure agreement, although it is inappro- 
priate because of its sensitivity to discrepancies 
between observed and expected cell counts in 
disagreement (off-diagonal) cells of the table. 
For example, in Figure 10, both Figure 10a and 
Figure 10b have the identical levels and pat- 
terns of agreement, yet in Figure 10 
shows exactly “chance” agreement, while in 
Figure 10b chi-square registers a large dis- 
crepancy between expected and observed cell 
counts. The problem with using the standard 
chi-square, then, is its sensitivity to discrep- 
ancies between ol 


bserved and expected counts 
In every cell of a table. To examine the pattern 


of agreement between two observers, we need 


à statistic sensitive only to discrepancies on the 
agreement diagonal, 


ved and expected 
would thus be useful. 
s need when pointing 


a chi-square 


A modified chi-square approach cnables us 
to examine pattern agreement. Lumping 
together all the off-diagonal cells, so that 
discrepancies between observed and expected 


cell counts will not affect our measure, leads to 
the statistic: 


b. Observer 1 


10 
ib] os s| 1 
uS 
Observer 2 2| 5 | || E 20 
ian ae 8 
3| n | 1 Br 
a o wia 
x2 = 12.38 bn 
ing agreemen ^ 1 
are statistic is inappropriate for measuring 28 J is 
ent ^ 
This measure of pattern "e focust il 
merely a variant of chi-square h and poo! 
the agreement cells individually, "When d 
disagreement cells into a set. lev 


ff " he 1€ 5 
together with x, which estimates 1 gure 
observer agreement, these CHE oÉ het 
should offer a more complete es i a bre 
relationship between two absene Aer ran All 
measure could give individually. symptoti jo! 
agreement, A, is distributed t + eed os 
as chi-square with C degrees 0 in 
aC XC table. 


, data 

the ~ all 

To illustrate 4,, return ta seach e all 
Figure 9. In Figure 9a, Ap = 0, 28 


^ total © nd 
the agreement diagonal and We. bser ed 
nonagreement cells have identica! ob 
tas TP ] 
expected cell counts. For Figure 


Q—4*  (6—4* 
EE ALLEN 


A, 


pe? 

egre 
h 3 des a 
bsc ac 


and a chi-square value of 2.0 wit the 9 
freedom does not indicate that the € 
agreement pattern differs bo us, for PT 
pattern at, say, the .05 level. 'Ph en 
9b, we cannot conclude from he p? mo 
Ap = 2.0 that either the level or eue ed Je 
agreement is different from the € ; 
and pattern. be zer? M 
Note the possibility that Kan e nt; h 
A pis very large and highly significa zero 
if A, is zero, x is constrained to be 7 


a 


» te 
i-square 
out that for the standard cage BY 
contingency table, the presence of ME cos 
(n5) is very small, such as 1 or 2, may 
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impair 
the chi ai ; : : " Š 
à he chi-square statistic’s behavior as an as chi-square with C degrees of freedom, is 


Pproximati : a € 
Er nation of its asymptotic reference dis- derived from the general model of the (C 4- 1)- 
n. Ap, while asymptotically distributed nomial. That is, while A p is computed from the 


| TABLE 1 
ES OF PATTERN AGREEMENT STATISTIC Ap 


. Monte CARLO STUDI 


Second marginal Mese First marginal 
CON variance x a 
— 2 y A. 3 X 3 Contingency tables N ; 
3 40 .20 .20 80 
33 40 30 .20 10 
En 20 50 .60 10 
n= Ls e um am m dee) 
E O 50 200 50 200 50 200 50 200 E 50 200 
33 — | agr 2 | 2:69) 20 302 296 | 3.02 301 | 3.01 3.01 
— C 33 34 Var 625 582 | 647 5.95 618 611 | 6.38 6.31 746 6.31 
* E 3.08 2.97 3.06 2.97 | 2.98 2.95 | 2.94 3.06 
) 1 5 i 
E A0 .20 ar 6.29 6.00 6.73 5.99 | 6.13 6.10 | 6.80 6.25 
EM. je 
à —— TE 3.00 3.04 |Y 297 3.00 | 3.001 3.03 
B a 20 ] 6.37 625 |. 6.59 6.09 | 16.61 649 
— B aa ^ z E 
Serre * 3.08 2.99 | 3.04 3.06 
E E m" ve 6.718 6.30 7.23 6.30 
aM 2.06 2.98 
d 80 38 38 T 949 7.03 
—— RENE z| * 
B.4xX4 Contingency tables 
oe - 
35 30 10 10 .70 
25 30 10 20 A0 
25 20 440 .30 10 
25 .20 AO 40 .10 
F d n= rm a= n= 
so 200 50 200 50 200 50 200 50 200 
p 
a LL 1 | 405 401 3.99 40 
i a 391 40 1 1 à OL 4.00 4.04 3.99 3.99 
E 5 df | Sum | 910 sis | 909 855 | Bre Bob 91i Bis 
"reo 
a 399 401 404 401 | 4.01 3.98 | 403 3.97 
80 ay 20 20 Var 8.77 8.01 | 10.07 8.79 | 886 821 9.99 818 
fee ; 
L 
m 3.97 3.92 | 3.95 400 |* 4.0 
" = 01 3.96 
4 
Ü du dt EA n "M 10.76 848 | 10.05 838 | 10.59 gat 
M 400 | P 
4.01 40 
T-— oe 02 4.08 
0 4 | 4979 902 | 10.33 8.70 
M | m 
ar | 401 3.97 


49 ao 0 «10 
x. = l | 13.86 8.57 


ES. 
— ihe mean and variance of A, for various combinations —————— 
Ang Note,—The table siv 3l nypothesis. p for various combinations of marginal probability vectors, f 
ie s, for n = 80 


n = 200 under t 
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contingency table format, it is simply 
tion of the individual cell probabilities 
C diagonal cells, 

We therefore present. the results of some 
sampling experiments, which enable us tostudy 
how well 4, is referenced by chi-square (C) 
when (a) one or both observers assign items to 
categories with very unevenly distributed mar- 
gins, such as with probabilities -80, .10, .10 in 
à three-category assignment; (b) the proba- 
bility structure of the two margins are substan- 
tially different, due to differences in overall 
rating strategies between the 
(c) because of the marginal pr 
tures, one or more K 
diagonal is very small. 

Table 1 gives the results of 
experiments; Table 1A for 3 x 
Table 1B for 4 X 4’s, In these ex 
probability structures of both r 
Specified, together with the 
items » to be 
Five marginal I 


a func- 
on the 


two observers; 
obability struc- 
nij) on the agreement 


the sampling 
3 tables, and 
periments, the 
marginals were 
total number of 
Categorized by each observer. 
probability structures were ex- 
amined for each observer, which when com- 
bined in pairs and with themselves gave a total 
of 15 possible contingency tables, Fifteen 
3 X 3 tables and fifteen 4 X 4 tables were thus 
examined. For each of these 30 t 
were generated with jj = 50 observations, to 
Tepresent a small total sample, and x = 200 
observations, to represent a moderately large 
sample. Thus, 60 different experiments were 
run, 

For each experiment, 5,0( 
generated, 4 


ables, samples 


J0 samples were 
» Was computed for each sample, 
and the mean and variance from each set of 
5,000 samples are given in Table 1A and 
Table 1B. In Table 1A, the expected mean and 
variance for A, under the asymptotic reference 
distribution are 3.00 and 6.00, respectively; 
in Table 1B, these are 4.00 and 8.00. Examining 
the results of these experiments helps us to 
analyze the behavior of Ap under the three 


Conditions posed above. 


Results 


First, regardless of 


the probability structure 
of the table, 4, h 


as its observed variance closer 
to the expected values when » = 200 than 
when x = 50. The mean always fits well. This 
was true for all 30 table structures. Thus, the 
larger the total number of items to be cate- 


gorized, the better A, will be approxima 
chi-square (C). sp results i 

Second, for 3 X3 tables, fici d by che 
ly being quite well ay apr robability 
Square's first two moments for a P observers 
structures except when one or bot oa single 
concentrate most of the items vete vark 
category. When this happens, pe ‘timated y 
ance of 4, is substantially underes both ob: 
chi-square (C). For example, ipae ne 
servers tend to assign 80% of the Á other iV 
category, and 10% to cach of the s0 15 


s : or H — an 
categories, the variance of pe rather tha 
found to be approximately 9.49 ri 


the expected 6.00. = 50 om 

Third, for 4 X 4 tables, when ^ imate D 
Square (4) will generally undet es both ? 
actual variance of 4,. ily Wü, appr’; 
Servers assign items to categories P bility o m 
mately equal probabilities (a proba Mec is t 3 
Íor each category, for both obser A M) ap 
experimentally observed bue 00. i 
proximately equal to the expected £ yielded 

Fourth, for all 30 tables, 7 = 2 i ls close i 
mean and variance for 4 p reasona mar A 
their expected values, Thus, for mos a 2 
probability Structures, a sample size "nis ud i 
for 3X 3 or 4 X 4 tables indicate S puti 
(C) will be a reasonable reference pe cells Jp 

Fifth, for table structures that x nij bs 
the agreement diagonal with low hat lag f 
has an experimental variance somew ót n 
than the expected variance. ] da ea ch ot 
given z, the larger the gidhaiuu ihe et 
the cells on the agreement diagonal, 


i Ww Ap 

the referencing of 4 » by curaque ir 3 
Finally, because the actual TE t nO 

generally exceeds the expected vat ni 


exam’, g 
chi-square, the use of chi-square to M! n 
table for pattern agreement will aa 
conservative test. Researchers may S ob 
this knowledge when interpreting P 
levels. 


abi < 
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MEASURING 


NOMINAL SCALE AGR EEMENT 


AMONG MANY RATERS! 


JOSEPH 


Biometrics Research, 


Cohen introduced the 
(Cohen, 1960) and weighted kappa (Cohen, 
1968) to measure the degree of agreement be- 
tween two raters who rate cach of a sample of 
subjects on a nominal scale. Both kappa and 
weighted kappa incorporate a correction for 
the extent of agreement expected by chance 
alone. Kappa is useful when all disagreements 
may be considered equally serious, and 
weighted kappa is useful when the relative 
seriousness of the different kinds of disagree- 
ment can be specified, Properties of these two 
Statistics have been studied by Everitt (1968) 
and by Fleiss, Cohen, and Everitt (1969), 

The use of kappa and weighted kappa is 
restricted to the case both where the number 
of raters is two and where the same two raters 
Tate each subject, Generalizations are therefore 
necessary for the case of more than two raters 
and for the case where the raters judging one 
subject are not necessarily the same as those 
judging another. 

Tn this paper we consider only 
tion of unweighted kappa to th 
of agreement among 
raters where there is 
the raters judging th 
important s 
rated by 
consider t 


statistics kappa 


the generaliza- 
€ measurement 
any constant number of 
no connection between 
€ various subjects. An 
pecial case is when subjects are 
different pairs of raters. We do not 
he case of varying numbers of raters, 
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National Institute of Mental Health. Myron G. 
Sandifer, Jr., Associate Dean for Academic Affairs of 
the University of Kentucky College of Medicine, kindly 
provided the data on which Table 1 is based. . 

* Also at Columbia University. Requests for reprints 


should be sent to Joseph L, Fleiss, Biometrics Research, 
State of New York De of Mental Hygiene, 


‘partment 
722 West 168 Street, New York, N. Y. 10032. 
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agreement between 
case where each of a 


: od by 
or the case where each subject is ane ot 
Same group of more than two ra zd 
case where weights can be assign 
various disagreements. 


NOTATION 


- of subje 
Let N represent the total number ars dk 
n the number of ratings per subject, 5. mé 
number of categories into which t BE 
are made. Let the subscript i, wher es 
N, represent the subjects, and the at 
J; where j= 15... k, represent the 
of the scale, aters 
Define ni; to be the number of p cate 
assigned the ith subject to the jth 
and define 


= 3d B 
‘ip 

subse” 

su orie 


gor)? 


. ow 
Z nij. 
i=l 


Pi Nn 
, | as 
The quantity 5; is the proportion dius 
ments which were to the jth catego™ 
jni; = n, therefore X; p; = 1. ET. k 
Table 1 presents data on N patie 
diagnoses were given each of N = < E. v 
by » — 6 psychiatrists, Thus, Pind 124 ^y, 
mj = 0 for j = 4; na = 3, na = ie pro 6s 
for j æ 2 and j = 5; etc. The n = 
tions are p= 144, po = pe of 43 
Ps = .306, and ps .239. A tota m 
chiatrists provided diagnoses. In - 
Study (Sandifer, Hordern, Timbury, from © a 
1968), between 6 and 10 psychiatrist: a6 E 
pool of 43 were unsystematically 8e p urp 
diagnose a subject, For the "nro Ne wed 
of this paper, randomly selected pp 1 
dropped to bring the number of deem 
per patient down to a constant of six. 
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MEASURING 

Th EASURING OVERALL AGREEMENT 
€ exte á 

for tlie 3 nt of agreement among the x raters 
A subject may be indexed by the pro- 


Porti 

On of agree; Y 

Possible pum MB out of all the x (z — 1) 
^s, Say, airs of assignments. This proportion 

ES. 1 k 


E hag 3 Maly — 1) 
i= 


1 k 
x n(n — 1) um nyc n 


[2] 


> 
ten, Pis l; Py= 40; etc. The overall 


Agreement may then be measured 


e Meg 
in of the Pis, say, 


B_ly 


WEP 
is, 


x! - m 
Vana — 1) [B mage 


= [3] 


u i=1 jel 
he d; P 

| * data of Table 1, 

| Aer 5556 

COTON — 30 X 6) = -55° 

Th. 
les Value P = ,5556 may be interpreted as 
“nq Ws. Let a subject be selected at random 
hi, enosed by a randomly selected psy- 
V go. St TE the subject wer 
Seq, Cond 


gnosed by 
d psychiatrist, the 
with the first 
like P have 
by agreement 
San qon ifer, Pettus, kE * 
ing difer, Hordern, Timbury, and Green i, 

Y Sandifer, Fleiss, and Green (1968)- 
that some degree of 
d solely on the basis 
aters made their 


e also dia 


randomly selecte 
diagnosis would agree 
6 of the time. Measures 
Used to measure diagnostic 
and Quade 


c 
ten 


1S clear, however, 
Ment is to be expecte 
if the r 
at random, one would 
agreement to be 


A 
Ce 
oS 


Che 
Age ance. In fact, 
k 


1 
aV, BIments purely 


| Vect the mean proportion pf 


pP- [5] 


hy 
Dr the data of Table 1, 


tm qage p 144? 2 
144? + uan -239° = .2201. [6] 


I 
6 
no} 


NOMINAL SCALE AGREEMENT 


TABLE 1 
DiaGxosEs ON 30 SUBJECTS BY SIX RATERS 
PER SUBJECT 


Category 


Subject Schizo- | Neurosis | Other 
| pirea | G=) | G=5 
i | 
1 | | | 6 
2 | | E | a 
3 1 4 1 
3 6 
Š 3 4 
6 2 4 
7 4 2 
8 2 3 
o. | 9 ý 
10 E 
"il 1 | 5 
12 1 1 4 
13 | 3 3 
14 1 5 
15 | 2 2 i 
16 5 1 
17 3 1 2 
18 5 
19 2 4 
20 1 2 3 
21 6 
22 1 3 
23 2 1 3 
24 g 4 
25 1 ^ 1 
26 5 1 
27 4 
28 2 4 » 
29 1 5 
30 6 
Total | 26 26 30 55 43 
bi 144 144 A067 306 | .230 


The quantity 1 — P, measures the degree 
of agreement attainable over and above what 
would be predicted by chance. The degree of 
agreement actually attained in excess of chanc ` 
is P — P,, so that a normalized measure p 
overall agreement, corrected for the amo - 
expected by chance, is piso 


[7] 
For the data of Table 1, 


. 5556 — 2201 


odes S07 74390. 


[8] 
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An equivalent expression for x is 


$ 
ni? — Nall + (n — D Z p] 
m : E 
-Yn(n — 1)(1— X 


m 


bj) [9] 


STANDARD ERROR OF x 


It seems reasonable to interpret the 
of real agreement among the z raters 
inability to distinguish one subject f 
other. Such an inability implies th 
apply the overall rates of 
to each and every subject. This in turn implies 
that under the hypothesis of no agreement, the 
n assignments on each subject are multinomial 
variables with probabilities [E Pes 

If V, the number of subjects, is large, we 
may assume that the proportions Pry ceog Di 
given by Equation 1 are constants so that by 
Equation 9, « is essentially a function only of 
the random variable Xi djn. We assume 
further that even though pi, ..., pr are held 
constant, .\ is large enough so that the sum of 
the squared numbers of assignments for the 
ith subject, Zjn;j, is independent of that for 
the i'th subject, yn. Thus, if N is large, the 
variance of x under the hypothesis of no 
ment beyond chance is 


absence 
as their 
rom an- 
at the raters 
assignment, (5), 


agree- 
approximately equal to 


N k 
2 Var( 2 n) 
Varo) = — = da - [10] 
NVn?(n — 1)201 — E p 


y 


= E(Xnj)-[E(x nag?) 


Il 


FE(Zni*')d-E(XE nini) 
J itm 


— [ee = me). [11] 


The required moments are found to be 


(ZV ny) =n + n(n — 1) 
j 
X[7Z p?+6(n —2)x p? 


+ (n = 2)(n — 3)> pr, [12] 


ECEE ni Phim) = n(n — 1) 
p 


+ n(n — 1E(Qn — 5) pt 


3 


—2)Zfi 
+ (n — 2)(n — 3)(2 pit — 2H ) j 


13] 
— (n — 2)(n — 3)> pith i 
and 


CE(2 nj)? = n? + n(n — 1) 


"E W 
X [2n p + n(n — TE pi) 


Thus, 


Var( £ n5?) 
d xp» 
= 2n(n — DLE p? — Qn — (X P T 
P 3 " 
1233 pi J, 
+2(n E " 
Si 
-Varl 2j 
and, because X; Var Xing) =A Var 
therefore 
2 
Nn(n — 1) $ ; pi 
2 pF? — Qn — 3)(X pay? ES lab pee 


Var(x) = 


x a =S 42) 


i 


For the data of Table 1, we = 2; d 
found that X; p; = 2201. Furth i } varið 
= .0484 and X; p = .0529. Thus, t"! 
of x is approximately 


2 
Var(x) = (30)(6)(5) P 
.2201 — (9)(.0484) + (2)(4)(052 
: (1 — .2201)? g 
= 000759. » 
Ce T2 
of £d 


and the standard (SE) 


proximately 


error 


[18) 
SE(x) = .028. M" 


ax 
beyo s 
ment l jimit 


. agree 
Under the hypothesis of vs A 
chance, «/SE(«) will, by 


as 
E X] & 
É elv distribute 
theorem, be approximately dis 


centra . d 


ba 


NOMINAL SCALE AGREEMENT 


5 
! andar norm 


al variate. In our case, 
x _ 30 
| SE() 028 7 


[19] 


= 
Un 
He 


E. 
e A . 
(ain infer that the overall agreement 1n 
Reate nent to the & categories is significantly 
| er than chance, 


Acn EE 


ENT ON A PARTICULAR CATEGORY 


We conc: 
à ic Consider now the extent of agreement in 
hec a a subject to category j. To motivate 
4 Bie d of the basic index of agreement, we 
the p » that each subject is assigned to one of 
n uis a by a randomly selected rater. 
€ rat nose subjects assigned to category 
‘lected P again, now by another andan 
opa; rater. 'The index is the conditiona 
3 ity that the second assignment !5 to 


Cat, 
jS as to category 


Ory f gi i 
Tt k J, given that the first w 
calculated as 
N N 
- x Xa ; 
È. (p -1 m^ 
= : i 


[20] 


Se Pj is defined by Equation 1- A 
ier the hypothesis of no sere oE Ba 
e ance, we would expect this proba : ity 
Si EM the unconditional probability ak an 
of &nment to category f, namely Pi- A meas Y 
ws; € EXtent of agreement beyond chance 
Snment to category J iS then 
D> 
Db 


I$, 


Lr 


N 
` Zoup—XNap[icC-— Dp [21] 
EET Mo. = , 
Nn(n — 1) pi4i 


the meas- 


that «K, à 
9), is 


(Equat ion 
Tn fact, 


Whe 


le 6 = 1a Note 


je 

of overall agreement 

Cighted average of the zw 

x = Epit Zipli 

If M i ao we can assume that pj is a 

ale i elk with this lxepewr, that 
| n. , nt and 2 i = 

tij is ind x lent of ny; for! = i’, Under the 

je independ more than chance agreement 

in eain of E category j, the quantity ni 

assignment i i 
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TABLE 2 
STATISTICS FOR MEASURING AGREEMENT ON Eacu 
OF THE Five CATEGORIES OF TABLE 1 


Category B: m? Pi P; * Var (43) | «7/SE(«;) 
1 72 248.0130 2 
2 72 248 .0130 2.17 
3 120 517 | .0136 444 
4 229 470) .0195 3.36 
5 187 565, .0163 | 4.43 


has a binomial distribution with parameters 7 


and pj. 


Under this hypothesis, 


Var(i;j) = pigs 
x EQ 4-20: — Dp)? + 20: — Doa] [22 


"Therefore, the approximate variance of «j is 


[1 + 206 — 1) pj}? + 20 — pus 


Var(sj) = Nn(n — 1?pjq; 


The hypothesis may be tested by referring the 
quantity «j SE(«j) = Kj VVar(«)) to tables of 
the standard normal distribution. 

Table 2 gives summary statistics for each of 
the five categories. The calculations for 
Category 1 (consisting of involutional melan- 
cholia, manic-depressive depression, and psy- 
chotic depression) are illustrated in detail. 
Since Xina? = 72, and pı .144, the basic 
index is, by Equation 20, 


— (306) C144) — ,. 


(30)(6)(5)(.144) 7 "9 


indicating that, given that one assignment of a 
subject is to Category 1, the chances of a 
second one being to the same category are over 
one in threc. 

By Equation 21, 


396 — . 
= a 248. 


= 7 


P= 


[24] 


[25] 
The approximate variance of xy is 
Equation 23, 3 
Var(«1) 


_ [1+ 2V(5)144)F + (2)(5)(.144)(.856) 
(30)(6)(5)*(.144) C856) 


= .0130, 


by 


[20] 


382 JOSEPH L. FLEISS 


istics kal 
so that the approximate standard error is Everrrr, B. S. Moments of the statistics 
114. Thus, 


weighted kappa. Britis Journal of M a 
Statistical Psychology, 1968, 21, 97-103. 


le. 
«e samp 
Ki .248 E Feiss, J. L., Cong, J., & Everitt, B. a kapp*y 
SE() = re 2.17, [27] Standard errors of kappa and 27 
A 3 Psychological Bulletin 1969, 72, 323- DENEA QM 
: s » GREEN; 5 
indicating that the Agreement in assignment to SANDIFER, M. G., Preiss, J. Ly & evalu 


Category 1 is significantly better than chance 


à A *« 2n elinical drug 
Sample selection by diagnosis in clinical 
at the .05 level. 


3 8-128. 
tions. Psycho pharmacologia, 1968, 13, ne 3 
SANDIFER, M. G., HORDERN, A., € s. 
LIE GREEN, L. M. Psychiatric diagnosis: 7 
ZFEREN(CIEG > x ad 
REFERENCES study in North Carolina, London an 
COHEN, J. A coefficient of 


agreement for nomina] 
scales. Educational and s 


1 p, D. / 
‘ychological Measurement, SANDIFER, M. G., PETTUS, Ojo Quan ous 7 
. 1960, 20, 37-46. er of psychiatric diagnosis. Journal of Ne 
Conen, J. Weighted kappa: Nominal scale agreement Mental Diseases, 1964, 139, 350-356. 
with provision for scaled disagreement or partial . à : i 
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ERRATUM 


" avid 
In the article "Current Research on the Frequency of Dream Recall” by D 
Cohen in the June 1 


s a hou 
970 issue, lines 26-30 of the left column of page 437 uency 

read as follows: “Relatively high anxiety was related to significantly lower freq 

of dream recall in groups marl 


ked by intellectualization and abnegation. . . - 


ppa 9 
tical 0 


QE 
| Cd 
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 PRoPRrOCEPTION AND THE TIMI 


NG OF MOTOR RESPONSES 


| RICHARD A. SCHMIDT? 


Two hypotheses that relate moto 
are presented, along witl 
the subject uses the inflow 
of a movement sequence to cu 
evidence was found to supp? 
decay hypothesis states that tl 
back in short-term memory to cue 
recently has clear support for 
possibility that feedback is no 
sequences being programmed. 


The y; 
Practie high levels of skill displayed by well- 
th m Subjects are highly dependent upon 
Ness ms of the various movements, that is, 
ime, the submovements occur at the proper 
Noord; total response appears jerky and 
that minutes" Adams (1966) concluded 
ae was one of the most important 
avi x studied aspects of human motor 
tha Wo") and, in fact, Michon (1997) stated 
y is “the fine structure of the timing 
fios." successive elements in à string of 
qe 3 which characterizes skilled. perform- 


oF th D. 1]." As important as timing seems 
he mechanisms of 


i erformance, the 

es , iming in sk 

tq, t mention of the role of tinie A 
wh 


St. 85 late as 1949 from Helson, r 
Oy Cd that subjects in a t^ ing task with- 
‘ih 3 review were able to track 
Voy Wave target course with lags t 
th Y near zero, Helson 949) conclu 
the Subjects were learning 


H 
o] arg nd were 8 
h get course à ? were V 


of 
oF 
A 


an, 
N. Be: -ning error P 7 
Thi E that pu to anticipate i5 ex- 
Hity oss 2) d ack A. Adams, 
jg exteno Elis, Judith L. 


1 A 
| Job bP reciation spacial: 
vith M. Christina, M Imach for the helpful com- 

tg, 8nd George this paper. Requests 
gt Fs 9n an earlier C to Richard A. Schmidt, 
^ Ba, “prints should b? Diversity of Michigan, Ann 
Any Tbour Gymnasium 

9r, Michigan 48104. 


Department of Physical Education, 


r response timing to proprioceptive 


h speciñc prediction: 
of proprioceptiv 
e the timing of a later 
oint of view. 
he subject uses the decay 0 
the timing of a latter mov 


rt this pi 


this view been prov 
t used at all, with t 


University of. Michigan 


feedback 
s. The input hypothesis states that 
e feedback from an earlier portion 
portion, and considerable 
Thé Adams and Creamer 
f proprioceptive feed- 
ement, and only 
ided. Finally, there was the 
he timing of the various 


tremely important for the understanding of 
motor skills, for without the ability to antici- 
pate the arrival of future stimuli (e.g, a 
rolling ball), the subject would be forced to 
respond with drastic delays due to reaction 
time. Thus, being able to anticipate allows 
the subject to ready himself for the upcoming 
event and to respond with ‘the near zero lags 
that Helson noted. 
Timing has received most of its study in 
a motor skills context using tasks in which the 
subject tries to anticipate the arrival of some 
event in time and is required to make some 
motor response such that the response occurs 
with a predefined relationship to the external 
event. An example is hitting a ball, in which 
the subject anticipates the arrival of the ball 
and times his response so that the ball is hit 
when it is in a certain location. In the con- 
text of making responses generally coincident 
with some external event, Poulton (1952 
1957) has defined two classes of anticipatio: 4 
Receptor anticipation is involved when the 
subject can preview the arrival of the he 
coming stimuli to which the response i ius 
made. The word “receptor” impli 18 to be 
de. nc receptor" implies that t] 
subject is using his various extero- he 
(e.g., eyes, ears) to preview the e receptors 
class of timing is of very great vents. This 
motor skills since so many tasks ( M apes to 
a pitched ball) require preyi £., hitting 
quire preview and a timed 
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response. Perceptual anticipation, on the other 
hand, is concerned with situations in which 
the subject cannot directly preview the up- 
coming stimulus events, but in which the 


Tn of regularity 
Thus the subject 
pation, and, once 


; Since it is 
proprioception is a rich 
and it is thought to have 
control of motor responses, 
68) has reviewed the liter- 
ature and found little Support for the idea 
that proprioception has anything to do with 
learning, and Laszlo (1968) found that pro- 
prioception was not necessary for the per- 
formance of certain motor responses, this does 
not preclude proprioception from being im- 
Portant in the timing of motor responses. The 
Possible relationship between the response- 


produced Proprioceptive feedback and the 
timing of that 


the topic of this study. 


Source of stimuli, 
importance for the 
While Adams (19 


* Ellis, M. J. Possible me 
clock. Unpublished 
Urbana, 1968. 


chanisms for the human 
manuscript, University of Tllinois, 


Propriocertion DEFINED en 
propria. 

[thous 
them: 


[11 

Historically, “kinesthesis” and à 

tion" have had similar wae Oe, 
technically there is a difference be term ki 

According to Adams (1968), ue s to is 

thesis was first used by Bastian ( di 


«able to 
4 licab PE 
down. Also, kinesthesis is not app ariol! 


5 the 
the perception of the position of only 


yeme: e 
ception to mean the sense of moy with E 
Cluded under kinesthesis, mx po d 
forces, pressures, tensions, rela dy's ori p 
of the body segments, and the n this 15, ; 
tion and movement in space, cinesthesis 5j 
Way the term is used here. ES) poma 
more limited term, and Adams t to: E Pee 
out that it would seem desirab e. prop! m 
this way. The primary receptors nich lie 
ception are the proprioceptors, W an ay 
the muscles, tendons, and Tolgiso the por 
mine movements and positions. me à A 
segments. Sherrington also P roprioc® "m 
labyrinthian mechanism as a DP q with int 
This study is primarily Pet to tim 
Prioceptive feedback as it rela on the ^, 
behavior, with specific emphasis ceptor (e8 
of the proprioceptors and exteroc 
eyes, ears). 


MING "m. 
PROPRIOCEPTION AND TH 0 


y rop” 
Probably the first indication mi V5 
ception was related to Hen (1 Jeg? 
Goldstone, Boardman, and pit col E. 
They had subjects (young chil ie et 
students, and older adults) ‘eu m e. 
onds under varying conditione. optics 
aloud, which resulted in greater pu. a 
tive feedback from muscles of the 3 e ‘i 
larynx, produced greater ogee i 
Mating silently. Also, when y them “it 
asked to remain silent, a mover e 
lated instructions prohibiting ot a am or 
(e.g., tapping of the feet, ari preis 
ated movements of their own, W The auth og | 
ably aided the estimation D of coui 
concluded that the timing tas 
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Y : 
c grae upon proprioceptive feedback 
tues must on the proprioceptive | feedback 
Vals in so € associated with the time inter- 
that the E prior learning situation in order 
e ES. i e used effectively. Children below 
ao acs: 8 years apparently did not have 
tated Menus at hand, as they demon- 
fre are arkedly inferior estimations. Also, 
Seem to Musee of common examples that 
telated icate that proprioceptive feedback 
Who taps Ta timing, such as the musician 
Seneratin is foot or nods his head (thereby 
time to E proprioceptive feedback) to keep 
Xperin e music. Such observations, plus the 
a dental work of Goldstone, Boardman, 
Oman (1958), have led to a number of 
and hm of how proprioceptive feedback 
ese is PE might be related, and each of 
iscussed in turn, giving the various 


Ypoth : 
dence, eses, predictions, and supportive evi- 


St 


In 
Ee Hypothesis 


The 


Com input view is the most obvious and 


topra ense explanation for the linking of 
aintai eptive feedback and timing. This view 
a Serie ns that if a subject is required to make 
Drioce S of timed responses, he uses the pro- 
tion Dive feedback from some earlier por- 
of „Of the sequence to trigger a later portion 
beans response. The term “input” is used 
Pattern the subject is viewed as using the 
feedh n of input or inflow of proprioceptive 
E s from the earlier portions of the re- 
Wa, ©, t cue the later portion. Stated this 
to sti the input view is really closely related 
hypo lus-response theory and the “chaining 
that pee The chaining hypothesis. states 
stimulis, proprioceptive feedback contains the 
While 1 to which the responses become learned, 
aint, the input view goes a bit further and 
M Hur that the proprioceptive feedback 
beco, ins the stimuli to which the response 
hop 8 timed; thus, one has to do with 
Curs Occurs and the other with when it oc- 
ily (Adams, 1968). An example that might 
[) Strate the use GE such a mechanism is in- 
ee in rhythmic tasks, such as pedaling a 
Eu In this ae 2e muscular forces for 
Ension of the ERhCe must be applied at a 
after the pedal has passed the top 


Sint soon 
E and thus require that the sub- 


the circle; 
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ject “time” the application of force. Eliciting 
the “command” to extend the knee, neural 
transmission, and tension development in the 
muscle take a finite and, under certain con- 
ditions (e.g., Santa Maria, 1970; Schmidt & 
Stull, 1970), substantial amount of time. 
These delays should not be confused with 
classical reaction time, since reaction time is 
the delay in response to stimuli that are not 
psychologically predictable and are suddenly 
presented, which is not the case for the pres- 
ent example, since pedaling involves anticipa- 
tion. Thus the subject cannot use the pro- 
prioceptive feedback associated with the leg 
being in the desired position (i.e., just after 
the top of the circle) as a cue for knee ex- 
tension, since the delays in the system would 
cause the response (contraction of the knee 
extensors) to occur too late. Thus, it is pos- 
sible that the subject uses the proprioceptive 
feedback from an earlier portion of the re- 
sponse (e.g., related to some position on the 
upswing) to cue. the knee extension and that 
through learning, the subject “chooses” pro- 
prioceptive feedback cues that will account 
for the delays inherent in the system and 
insure that the response occurs at the correct 
time. Thus, the input view says that the 
subject uses the response-produced proprio- 
ceptive feedback from earlier portions of a 
response series as cues for initiating later 
responses and that the chief advantage of 
doing so is the elimination of the drastic 
delays due to the lags inherent in the motor 
system. Thus, it is possible that the subject 
has some mechanism that receives the pro- 
prioceptive feedback input and cues the de- 
sired response. Exactly how this propriocep- 
tive feedback mechanism might operate is 
certainly open to question, but one might 
envision the proprioceptive feedback input as 
having some time-predictable pattern (as a 
function of the time-predictable pattern of 
the movements producing the proprioceptive 
feedback) and that when some stage in this 
pattern Was reached, the mechanism would 
trigger the desired response automatically 
Learning to anticipate, under this view would 
involve the strengthening of bonds betwe 
the characteristics of the propriocepti en 
s i ptive feed. 
back input from earlier movements and 
later timed responses in the end he 
knowledge of results. Thus, the subject [5 
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Seen as adjusting the cuing mechanism by 
selecting as a cue that portion of the pro- 
prioceptive feedback input that results in the 
most accurate timing of the automatically 
cued response, 

À number of testable predictions can be 
made from this hypothesis. First, since the 
mechanism is seen as responding to changes 
in proprioceptive feedback, with Some mini- 


A second Prediction is that th 
tive feedback must be consistent from tria] 
to trial in order that the subject learn the 
Pattern of proprioceptive feedback input and 


e propriocep- 


If the proprioceptive feedback 
able from 


proprioceptive 
leads to the Prediction 
in an earlier 


of the later response. 
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Evidence for the Input H ypothesis 


s in- 
There are a number of studies M b 
directly link increases in propriocep enerally 
back to improved timing behavior, ventioned 
Supporting an input view. As Pe that 
earlier, Goldstone et al. (1958) menétaium 
counting aloud (presumably g the lips; 
greater proprioceptive feedback eem accu- 
tongue, and larynx) resulted in eae oe to 
racy that counting silently in coun second. 
seconds at the rate of one count per manua 
Adams and Creamer (1962b) used vta ha 
Step-tracking task in which the su ith no 
to keep a pointer between two d 2 of 
preview. At regular intervals of eit disappea 
Seconds, the lines would suddenly f the dis- 
and reappear on the opposite side 0 nticipate 
play, and the subjects’ task was to a position 
the change and to move to the Peel pro 
as quickly as possible, They ren spring 
Prioceptive feedback by varying spring- 
tension on the control (the control pe^ a 
loaded to return to the center post ing 
tween the two targets) and by van e te, 
amplitude of movement between nticip4” 
target locations. Using “beneficial ae ich 
tions” (the proportion of responses p new 
the subject began his movement to termin?" 
target within 133 milliseconds of the criterio? 
tion of the present target) as por im 
Score, they found that spring ems 
proved timing accuracy, but vnb ether 
failed significance, Tt is questionable air te 
the amplitude manipulation was ven am 
for the input view, since the input subj 
plitude was manipulated while m M 
was moving to the new position, Tum unt" 
arriving, the subject remained pcr i a 
the next stimulus change. Thus, pr subje 
accuracy was measured (the time m no pen 
began his next movement), there W noveno, 
Prioceptive feedback from arm m e oa 
With which the subject could cue d E 
response, However, the spring avant P 
Were large and Supported the gn E 
increasing the amplitude between pata y, 
tions on the Adams and Creamer en E 
also should have, according to Hoo bold £ 4 
increased the tension required m load", 
cursor On target (since the spring t 


"rude 
tended to center the cursor on mp ws 
gets), and one wonders why ut as 
effects were not significant for this 


| 
| 
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Possibly, increased amplitude created a me- 
chanically unfavorable system for human re- 
D onding that worked against the propriocep- 
tive feedback effects. Additionally, minimal 
Orce cues might be sufficient to provide a 
asis for timing, and increasing force beyond 
Some minima] value might produce no addi- 
tional effect, 
. Grose (1964, 1967) had subjects either 
lit a finger from a key, move the hand 2 
cet and press a key, or walk 9 feet and kick 
parrier at the moment a moving pointer 
ith Preview—receptor anticipation) became 
8ned with a stationary one. He found that 
Crea * Size of the motor response was in- 
a PM there was a progressive decrease in 
ip rale. (directional) error, and a decrease 
Crea, € within-subject variability (ie. an in- 
thi Je in Consistency). One possibility is that 
Hoc responses generated eel ann 
Ab Ve feedback, which gave the su jec 
of, <r basis upon which to time the arrival 
Conch, Moving pointer. This is not the only 
the "Sion that can be drawn, however, as 
No Procedures confounded movement time, 
fee ment complexity, and proprioceptive 
**, and additional work is needed to 
Waq te these effects. Ellis, Schmidt, and 
Variap (1968) attempted to control for these 
lit. 5$ and also to determine the role of 
The. Sources of proprioceptive feedback. 
to Stine’ a task in which the subjects had 
à tra” 2 seconds by moving a slide on 
ian Way. The interval to be epe 
he au en. the subject began to move, ang 
ect attempted to arrive at the end o 


J 
tra "n le 
ae kway so that the entire response took 


itag Seconds. They manipulated the am- 
Ading or 65.0 centimeters) and the 
WAN essentially 0 or 8.5 pounds) of the 
eas qt ina 2 X 2 design. They found that 
NM Amplitude of movement resulted in 
NUM solute error later in practice, sup- 
je © idea that the increased € 
i ide ack from the increased movemen 
` timing € Subjects with an improved basis 
ipltuge” but it should be pointed out a 
in this and velocity cues were confounde 
SR y ign, Load effects were id E 
MS eating findings nearly opposite 
Hit Ose g 


"lg bogs’ E. Timing control in finger, € Ts 
Ah, Unj 9Vements, Unpublished doctora T 
Wersity of California, Berkeley, 1964- 
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those of Adams and Creamer (1962b). In 
another study to determine the effects of 
various modalities of feedback, Ellis (1969) 
used a similar task but manipulated the ac- 
celeration cues (by manipulating the inertia 
of the lever system), velocity cues (by manip- 
ulating the viscous resistance by an electro- 
magnetic brake), and tension cues (by manip- 
ulating the spring tension on the movement) 
and compared these conditions with no- 
resistance control conditions, His results gen- 
erally supported the conclusion that the nature 
of the resistance (and, hence, the propriocep- 
tive feedback) to the movement caused differ- 
ential effects on timing accuracy with greater 
proprioceptive feedback resulting in greater 
accuracy, greater response consistency, and 
fewer trials to learn to a criterion. Further, 
Ellis had subjects spell two-, three-, and four- 
letter words during their movements, and he 
found that timing accuracy increased with 
increasing word length, the conclusion being 
that increased word length produced greater 
proprioceptive feedback. Finally, when the 
spelling was withdrawn, the no-movement 
control groups regressed drastically in timing, 
while the movement conditions showed only 
slight decreases. Apparently, the propriocep- 
tive feedback from spelling gave the subjects 
some cues for timing, and withdrawing spell- 
ing left them with no basis for timing, but 
the groups with movement could still rely 
upon the movement cues for timing in the 
absence of the spelling cues. It seems probable 
from Ellis’ work that subjects do not rely 
upon such gross mechanisms as counting in 
everyday tasks requiring timing, and that 
proprioceptive feedback is used in some way 
t 7 uracy. 
ug cow nudas seem. to implicate 
proprioceptive feedback in timing somehow, 
is a methodological problem that pre- 
a ski a strong conclusion. In each case, 
ere rioceptive feedback was manipulated 
tne eese the dynamics of the control, and 
by e ‘changing the proprioceptive feedback 
nig e also created a mechanically more 
may eum system for responding. Thus, a 
vm test of the role of proprioceptive feed- 
hen in timing would involve manipulations 
of. proprioceptive feedback in one -— 
re the response effected in another imb. 
one and Christina (1969) recognized this 


x74 
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limitation and sought to manipulate proprio- 
ceptive feedback in the left arm by having 
the subjects execute a minimal finger move- 
ment or a small or large rotary movement. 
Beginning the left-arm movement also started 
a target, and the subject responded with the 
right hand when the moving target became 
aligned with a stationary one. No preview of 
the target was given, and essentially the sub- 
Jects task was to estimate when the target 
would arrive at the stationary pointer, the 
travel time being constant at 1.50 seconds. 
Using "beneficial anticipations" (the per- 
centage of responses that were within 133 
milliseconds of correct) as the criterion 
Score, they found that the small rotary 
movement of the left arm led to significantly 
greater timing accuracy than the control 
(finger response) condition, with the large 
rotary movement falling between these two 
extremes. The findings supported the view 
that increased proprioceptive feedback in the 
left arm contributed to increased timing accu- 
racy in the right arm, and thus the increases 
in timing found in earlier studies that manipu- 
lated proprioceptive feedback in the effector 
were probably not entirely attributable to 
purely mechanical effects of control dynamics 
manipulation. On the other hand, the findings 
Were not completely clear, since, according to 
the predictions from the input view, the group 
with the greatest proprioceptive feedback 
should have been most accurate, which was 
not the case. Further analysis of the data 
revealed that the large 


left-arm movement 
was far more inconsistent (within subjects) 


than the smaller left-arm movement (3.5 
times as inconsistent), and that it was pos- 
sible that the inconsistency of left-arm move- 
ment in the larger response provided a poorer 
basis for timing than the smaller, more con- 
sistent movement, This provided indirect 
Support for the input hypothesis, since con- 
sistency of the proprioceptive feedback input 
from trial to trial was hypothesized to be one 
of the relevant variables. 

One simple strategy which the subjects 
could be utilizing the proprioceptive feedback 
for timing would be to respond with the right 
hand when the left arm had reached a certain 
position. Schmidt anq Christina used within- 
subject correlations (Grose, see Footnote 3; 
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re 

Schmidt, 1969) to measure the degree Of 
lationship between the amount of left arm 
arm movement (the position of the js E 
when the subject responded et. respect | 
hand) and the algebraic error (wit x han 
to sign) in responding with the e subject 
over a series of trials. Thus, bae ted for 
had a correlation coefficient calcu oefficien 
him, the hypothetical W for the p over 
being the number of trials (i.e. Finding? 
which the coefficient was calculated. subjects 
indicated that exactly half of the “ait an 
tended to covary the left-arm ample hand; 
the time of responding with the r!É 40 
showing correlations greater op 
needed for significance at the .01 

Christina (1969, 1970) used 
right-arm task as did Schmidt an 
(1969), but investigated the effects Fd 
and movement proprioceptive Si S-inch) | 
had subjects make either a smal aon ent 
hand movement, a 13-inch linear ear m a 
of an unloaded slide, or a 13-inch ef 
ment of a loaded slide, each wit 
arm. The apparatus was like that 0 
and Christina’s (1969) in that the | 
initial movement of the slide ra hand n 
get, and the subject moved the le the m% 
one direction until he responded E that th 
task with the right hand. He four : 
groups with left-arm movement Lie pe 
the right-arm timing task more C? in t i 
than the no-movement control group, M 
load effects not being significant. etit 
there were no significant differenc^ aic of 
treatments using absolute eror, Sa 
ror, beneficial anticipations, O" 
trials to a criterion. The finding ia y E 
ficial anticipation effects was A pris? 
earlier findings by Schmidt an¢ 
1969). i 
In m earlier evidence adba% ny 
that greater proprioceptive amics wr 
manipulating the control dy” pese f° ivt 
effector improved timing, but t 1 
confounded mechanical and 


the sam 
d Christin’ 

nsio 
of tension 


pen 


ceP ine 
proprio wil! 


effects. Thus, the effects of Pe ing , 
drawal by Ellis (1969), and t Poi 


uU 
; s 2 "gnP 
* Christina, R, W, Proprioception Soses. onan 
the anticipatory timing of motor E: o * 
lished doctoral dissertation, Universi- 
1969. 
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Schmidt and Christina (1969) and Christina 


(1970; see Footnote 4) with left-arm pro- 


Prioceptive feedback manipulations and a 
tight-arm task, are the only sources of evi- 
dence that clearly link proprioceptive feed- 


ack to timing performance. 


Decay H ypothesis 


The proprioceptive feedback decay hypoth- 


&sis was first stated by Adams and Xhignesse 


(1960) to account for the timing of responses 


When there was no predictably changing 
perce of proprioceptive feedback input (e.g., 
vith perceptual anticipation). Essentially the 


Same ideas were presented again by Adams 


and Creamer (1962a, 1962b) and were for- 


mally tested by Adams and Creamer (1962b). 
he hypothesis states that if the subject 
makes a response at time £, the proprioceptive 
Sedback from the response enters a short- 
erm storage system (possibly short-term 
Emory) and decays predictably as a func- 
pe OF time. The pattern of decay is not 
nied, but the assumption is that a given 
Ovement will produce a predictable input of 


Proprioceptive activity, and this activity will 


decline predictably as a function of time in 
E ium memory. Now, if an additional re- 
ae is to be made at time £+ Af (At a 
constant), the decay view states that the sub- 
Jéct uses the time-varying properties of the 
a Prioceptive trace in short-term memory as 
time standard, possibly responding when the 
"Oprioceptive feedback activity had decayed 
M given level, thereby providing the sub- 
re, vith a basis for cuing the later timed 
Sponse, 
con Common example in which the proprio- 
Ptive feedback decay hypothesis might ap- 
Y is springboard diving. The proprioceptive 
Sedback cues associated with acceleration off 
d board would enter short-term memory 
t begin to decay, and the diver might use 
decaying proprioceptive feedback cues as 
ps upon which to make some later Ie 
whi? such as a precisely timed “untucking’ 
is th he is in the air. The decay hypothesis 
p distinct from the input view stated 
Her, since the decay view states that the 
T uses as a cue the decay of proprioceptive 
accel ack. from an earlier movement (e85 
eration from the board), while the input 
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view states that the diver uses proprioceptive 
feedback input from some concurrent move- 
ment (e.g., the proprioceptive feedback cues 
from rotation while he is in the air). 

There are a number of testable predictions 
that come from the basic hypothesis. As with 
the input mechanism discussed earlier, the 
decay mechanism is seen as a change de- 
tector with some minimum change required 
to activate it. Thus, the greater rate of 
change in proprioceptive feedback decay, the 
smaller will be the lag in activating the 
mechanism, and a response triggered by the 
mechanism should be executed more accu- 
rately also. One way to increase the rate of 
proprioceptive feedback decay, provided that 
the pattern of decay is exponential or at least 
not linear (e.g. a decreasing negatively ac- 
celerated function of time), would be to in- 
crease the absolute amount of proprioceptive 
feedback activity placed in short-term mem- 
ory, which should result in a greater decay of 
proprioceptive feedback in the first few sec- 
onds. This leads to the prediction that if the 
preliminary movement at time £ has large 
force or amplitude requirements so that it 
generates considerable proprioceptive feed- 
back, then there should be greater propriocep- 
tive feedback in short-term memory, a greater 
rate of decay in the next few seconds, and 
less lag in the mechanism for triggering the 
subsequent response, with the result that the 
timing of the response at time ¢ + A£ will be 
more accurate. 

Second, if the pattern of decay is decreas- 
ing and negatively accelerated, the rate of de- 
cay will be decreasing with each successive 
moment in time. If the interval Aż is short, 
the mechanism will be triggered when the rate 
of decay is great, and when the interval A£ is 
long, the mechanism will be triggered when 
the rate of decay is relatively smaller. Thus, 
since the lag in the mechanism is dependent 
upon the rate of decay, the mechanism should 
be less effective at long intervals and more ef- 
fective with very short intervals. This leads to 
the prediction that the decay mechanism 
should be very effective, relative to a no- 
proprioceptive-feedback ^ control condition, 
when the interval is short, but should be no 
more effective than a no-proprioceptive-feed- 
back control condition when the interval is 
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very long. In other words, there should be an 
interaction. between presence or absence of 
proprioceptive feedback írom a prior move- 
ment and the length of the interval (At) to 
be estimated. ; 
Third, if the preliminary movement were in- 
consistent from trial to trial as the subject 
was learning the task, he would have a differ- 
ent amount of proprioceptive feedback intro- 
duced into short-term memory on each trial, 
and the activity would decay to the cuing 
level in a different number of milliseconds for 
each trial. If this be the case, the subject 
would have no basis for pairing the decay of 
feedback with the time interval to be esti- 
mated (the time from ¢ to / + At), and per- 
formance would not be better than if some 
"cognitive" mechanism were used. Thus, the 
timing of the second Tesponse should be a 
function of the consistency (of force, ampli- 
tude, direction, etc.) of the first response, 
with the greatest timing accuracy being pro- 


duced when the first response is as consistent 
as possible from trial to trial. 


Evidence for the Decay Hypothesis 
Adams and Cream 


the display, he 
; so that he was on target 
again. The subject co 


predict the chang 
(perceptual anticipation ) 
manipulated the force n 
lever on targi 
loaded to retu 
and the mover 
that the propr 
response ; in 


in advance 
. Adams and Creamer 
ecessary to hold the 
et (the control being spring- 
rn to the center of the display ) 
ment amplitude, They predicted 
'ioceptive feedback gener; 


and that 
from response 
ming accuracy 
They found that spring 
nificant increase in tim- 
hey used "beneficial an- 


greater proprioceptive feedback 
? would result in increased ti 
for response n Ps 
loading caused a sig 


ing accuracy when t 
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on 

ticipations” (the number. of ers he 
which the subject started his oven ha 
new position within 133 milliseconds Wn am- 
the lines terminated) as the Kerl nd non- 
plitude effects were extremely smal a redicts 
significant. The decay hypothesis Wes ac- 
that shorter intervals should be m which 
curately estimated than longer rs) finding 
Adams and Creamer found, but this ay oF 
does not differentiate between the incu 
purely “cognitive” (e.g., counting) m longer 
which both predict poorer timing WI 
intervals. hy- 

Although the Adams and Creamer am an 
pothesis has been generally eee timing 
empirically supported mechanism p some 
(e.g, Singer, 1968, p. 74), there dams a” 
methodological problems with the A ore cat- 
Creamer experiment that warrant a monstrate 
tious approach. First, in order to de ioceptivt 
that it was the decay of the propr: s. it i 
feedback which the subject was br dur- 
necessary to have the subject be at T nt unti 
ing the interval from the first boca cou 
the second movement; otherwise, On ce as 
not be sure that the better perius due 
function of the control dynamics an k dur* 
to the input of proprioceptive Per w 
ing the interval (the input hypothesis pad t° 
ever, Adams and Creamer's subjects jon 
hold the lever in place against werd 
during the interval to be timed, an „e due n 
be that the spring tension effects we eedba¥ 
the subjects using the proprioceptive in erval: 
input from static tension during the cts ere 
Thus, although the spring tension effe twee 
significant, they do not differentiate... n 
the Adams and Creamer decay hypo di t 
the input hypothesis. The amplitu as me. 
Sion of the Adams and Creamer ves sin 
gave a valid test of the decay hypothe val E: 
the limb was motionless during rper n 
be timed, but the amplitude effects ive b 
significant, Second, the propriocepu? ted jd 
back from spring loading was gen nse; "1 
the same limb that effected the repe syst 
one cannot be sure that the RM mo. 
did not simply create 4 mechanic is, & 
favorable system for responding. ype 
valid test of the Adams and Creamer ic d 
= à fi : in Wc 3p 
SIS would involve a situation ta 
subject was motionless (with no § 
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Sion) during the interval to be timed, and in 
Which the proprioceptive feedback was ma- 
Nipulated by manipulating control dynamics 
Ma different limb from the one which is the 
lector for the timed response. 
Christina (1970; see Footnote 4) recog- 
"ized the design deficiencies of the Adams 
and Creamer experiment and gave the decay 
Ypothesis an additional test. He used a dis- 
Crete task in which the subject had to make 
à finger response to estimate the end of a 
“second interval and manipulated proprio- 
p? live feedback in a prior movement by hav- 
ie no movement, unloaded movement, or 
rid movement, each performed prior to the 
ie erval to be timed. His subjects (in the 
Ovement conditions) made a linear 13-inch 
Movement of a slide on a trackway with the 
eft hand, When the subject reached the end 
of the trackway, a switch started the interval 
E. be timed, and the subject responded 2 
ue later with a right-hand response, the 
inn hand remaining motionless during the 
su erval, If the decay hypothesis is true, 
apg with prior movement would be su- 
ends to the control, and prior movement plus 
Would be superior to the movement alone. 
diets were negative, with no significant 
" erences shown for algebraic error, absolute 
D and “beneficial anticipations," which 
not support the decay view. 
Eug en the proprioceptive-feedback-gen- 
(19798 left-arm movements in the Christina 
Sister, experiment were not perfectly con- 
Seer nt from trial to trial, and since consistency 
fens necessary for the decay method to be 
To Clive, this may have prevented the decay 
m providing an effective basis for timing. 
Tistina did attempt to have his subjects 
oute the left-arm responses as consistently 
the pssible, providing preliminary practice at 
5 eft-arm task alone prior to actual testing 
righty hasize consistency, „and, during the 
te nd practice, providing knowledge of 
m ts to inform the subject whether he was 
;Ponding within the acceptable a priori 
NEN lor the left-hand movement. Even $0, 
Vari eft-hand movements showed considerable 
Que elity within subjects over trials. Thus, 
Ma Sada and Schmidt (1970) attempted to 
by © the left-arm responses more consistent 
Using a passive, motor-driven movement. 
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Subjects in a movement group had their left 
arm moved passively through 26 inches at a 
constant speed so that the movement time 
was nearly consistent at 1.42 seconds on every 
trial. At the end of the movement, the sub- 
ject's arm came to rest on the arm of a chair. 
Control subjects had complete view of the 
moving handle, but did not hold onto it, 
their arms resting on the arm of the chair 
during the handle movement. The end of the 
handle movement signaled the beginning of 
the 2-second interval to be timed (also sig- 
naled by a light and a buzzer), and the task 
was to lift the right index finger from the key 
when 2 seconds had elapsed. On Day 1, sub- 
jects had 40 trials with knowledge of results, 
and the movement subjects demonstrated im- 
proved timing accuracy with smaller within- 
subject variability and absolute and algebraic 
error, which supported the decay hypothesis. 
On Day 2, one week later, subjects were given 
10 additional trials with knowledge of results, 
followed by 20 trials on which knowledge of 
results was withheld. It has been found (Ellis 
et al., 1968) that knowledge of results with- 
drawal leads to a drift in the direction of over- 
estimating time, and support for the decay 
hypothesis would be provided if there were 
less drift for the movement group. This was 
found to be so, with considerably less absolute 
error and within-subject variability for the 
movement group near the end of the 20-trial 
knowledge-of-results withdrawal period. Thus, 
these two findings provided two lines of sup- 
port for the decay hypothesis, and indicated 
that Christina. (1970) may have failed to 
provide support in his experiment because of 
the inconsistency of the left-arm movements. 

The Quesada and Schmidt study was the 
only one found that supported the decay hy- 
pothesis, and so has provided a basis from 
which to investigate further the generality of 
the decay view. Research is currently under 
way to investigate some of the other predic- 
tions of the hypothesis, such as the effects of 
left-arm consistency and of the length of the 
interval (¢ to £ + At) to be estimated, to de- 
termine whether this hypothesis is appropriate 
for other classes of proprioceptive feedback 
(e.g., force, vestibular), and to determine 
whether the hypothesis is supported for tasks 
using receptor anticipation. It is certainly an 


CP 


interesting hypothesis, one which could goa 
long way to explain the nature of the cuing 
process for the timing of motor responses, 
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Preprogramming Hypothesis 


Of course, it is possible that either or both 
of the proprioceptive feedback hypotheses are 
wrong, and that the subject responds on the 
basis of a neuromotor program without rely- 
ing upon proprioceptive feedback. Under this 
view, the subject has a program that dictates 
the timing of the various submovements in a 
sequence, and once the program is triggered, 
the timing of the Sequence is smoothly and 
precisely executed, This idea has been pre- 
sented by Fitts (1964), who referred toa 
hierarchical system of motor control implying 
that there are higher centers to select the 
Correct program and lower centers that con- 


tain the Program and execute it upon com- 
mand from the higher ce i 


ontrol of motor responses, and for 


studies by Adams 
but one study in 


arget beg & 


t would change his timi S 
that the left-hand but : uc 
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he 
would continue to press yu kr 
fast rate. This evidence indicates Aon feed- 
timing of the responses was not a ib mata 
back control, since subjects were able second, 
only two to three corrections pet ibjects 
Which was much slower than Pew S t ae res 
and might indicate that the timing "Howeveld 
sponses was truly preprogrammed. a subjects 
since Henry (1953) has shown wa ck with- 
can respond to proprioceptive x p it i5 
out perceiving it (also Posner, 19 cts were 
certainly possible that Pew's "ug dback 
responding with a proprioceptive i ng suf- 
basis, with the closed-loop contro id rate 0 
ficiently fast to account for the D UE view 
button pressing. Also, a preprogram thout any 
that predicts that subjects time w1 k wou 
reliance on proprioceptive fondue Ellis e 
have trouble accepting the findings e d Chris 
al. (1968), Ellis (1969), Schmidt a a 
tina (1969), Christina (1970), an 
and Schmidt (1970), who all EY ute! 
hanced proprioceptiye feedback ‘tainly pos 
to improved anticipation. It is cer rogral 
sible that timing jg governed by nm the 
only in certain Situations, and determ ramme 
laws governing when timing is peser con" 
and when it is proprioceptive fee 
trolled will be a major research task. 


DiscussioN 


5 ove”. act 
flow from earlier movements in p subje 
sequence as a basis for timing, t pa 
executing the timed response — that ng 
tern of input has reached some a proble id 
been learned. After methodologica was fo", 
Were eliminated, empirical support nypot ^. 
for the Adams and Creamer decay y of P as 
in which the subject used the decr emoty e 
Prioceptive feedback in owe e te ge 
a basis for timing, but it shoul der t° ipe 
further in a number of ways In d was 
termine its generality. Finally, the 
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Possibility that proprioceptive feedback is not 
Involved in timing at all, with timing of the 
Various movements in a sequence being gov- 
erned by motor programs that spell out the 
Precise temporal relationships among the re- 
Sponses in the sequence. Of course, it is pos- 
Sible that all three mechanisms are available 
to subjects, and that different mechanisms are 
pea for tasks with different display and re- 
Ee requirements. Determining if, or under 
vhat conditions, each of these theoretical 
n "'opositions is appropriate for the explanation 
maming in motor behavior will be a major 

ertaking, but a thorough understanding of 
Motor behavior will await the answers. 
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alues during intervals between successive 
practical problem in real life and a 


hough there has been a tendency for 
ye time error and the latter, negative 


ntly very important. Discrepant results 
fferences in stimuli and psychophysical 


lysis, peripheral aftereffects, adaptation, 
he results of a “model” 
€ errors are seriously 


study are cited 
affected by the inclusion of poor 
color memory are important. 
A a 

memory by optical physicists indicated g 
large apparent intensification of color a an 
memory, especially for the saturatior 
brightness aspects. 19305; 

2. Time error research. During the y psy” 
considerable effort was expended by occu 
chologists to determine the errors gr an 
during successive matching of stimuli We 
Sensory modality: the research was kno ap” 
the study of the “time error.” Errors emory 
parent stimulus intensification in he ap” 
Were known as positive time errores, e time 
parent fading in memory was a negar ate 
error. The time error studies clearly 5 ime" 
à negative time error for the intensive E 
sions of most stimuli, including ge 
tion and brightness aspects of color. its # 
error and color memory research rest 
thus clearly in conflict. ETSI (a) 

The purpose of the present study directi" 
review the literature relevant to the dur 
and magnitude of errors that Ee 
successive judgments: (b) discuss fac 
derlying the conflicting results; and 


ri y 
marize and evaluate the adequac) 
studies. 


é I 
Cotor Mrmory RESEARCE 


can 
The study of memory for color 


traced to Hering (Boring, 1942), 


jor to 
Some research was performed prior 
(Bentley, 
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the colors of unfamiliar objects. Following 
atz, color memory research waned until 
1942, when Hanawalt and Post performed an 
tensive four-part study to investigate satura- 
tion memory. 
, During the first part of their study, sub- 
Jects were shown color chips, which they at- 
tempted to match from nine very similar al- 
<a The experimental conditions were 
er a delay of 1 second or 1 week between 
€ presentation of the standard stimulus and 
tio, comparison stimulus. For the first condi- 
S n they found a positive time error, but no 
Ystematic error was found for the l-week 
terval, 
of os the second part of the study, a method 
) Imits, using a color wheel, was employed. 
[entire time errors were found on “ascend- 
p. Series (an increasing proportion of the 
‘tching color) and positive time errors for 
j descending series. An average of ascending 
no Acending series gives a nearly zero time 
qdttration memory was the subject of the 
Bst and fourth parts of the Hanawalt and 
Show, Study. In the third part, subjects were 
jon n standards of either 5076 or 75% satura- 
Com and were asked to indicate whether a 
va Parison stimulus identical to the standard 
T S more or Jess saturated than the standard. 
nd did not find time errors for either stan- 
I at either the 28- or 58-second interval. 
? the fourth study part, standards of 20%, 
4g is and 95% saturation were used. After 
ig , 9Urs, the subjects were asked to determine 
e, 22 identical comparison stimulus was of 
"àl, more. or less saturation than the stan- 
2 "d. They found negative time errors for the 
752° and 95% levels, but no error for the 
© Standard. 
amui and Landis (1955) had subjects 
of a each of 10 Ostwald chips to a set 
int 2 alternatives after memory spans of 15 
m, les, 24 hours, and 65 hours. The authors 
E S conclusions for hue, but they found 
liy, lve time errors for brightness and posi- 
Wa, „time errors for saturation. The effect 
Independent of memory time. 
Of E Intensive and carefully controlled study 
Bire or memory was undertaken by Newhall, 
br, ham, and Clark (1957), who were per- 
ed by the lack of knowledge and agree- 
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ment on errors in color matching and the 
great need for such knowledge: “whenever a 
color inspector or a technical color tester com- 
pares a color sample in one location to a color 
standard in another location, color memory is 
necessarily involved [Burnham & Clark, 1955, 
p. 164]." Three subjects were shown 25 
standards on a colorimeter screen. The 25 
standards were samples of different hues, 
brightnesses, and saturations. After a 5-second 
memory interval with a blank screen, each 
subject readjusted the colorimeter until a 
match to the standard had been achieved. 
Match times averaged 25 seconds after the 
memory interval ended. The authors found 
no constant error for hue; however, they ob- 
tained a large positive time error for satura- 
tion (1.7 Munsell chroma points) and bright- 
ness (.18 Munsell value points). Moreover, 
they found the size of the time error was 
highly correlated with the level of the stan- 
dard. The authors advanced the hypothesis 
that the saturation and brightness of a color 
is intensified in memory in proportion to the 
absolute intensity level of the stimulus. 

Three supplementary studies are also re- 
ported by Newhall et al. (1957) as providing 
relevant data on this topic: (a) A study by 
Burnham is essentially a replication of the 
preceding experiment; the results are almost 
identical, though match times average about 
50 rather than 25 seconds. (5) Burnham and 
Clark had subjects match Munsell chips from 
nine similar alternatives after a 10-second 
memory interval. They found, in agreement 
with the two previous studies, a positive time 
error for saturation that increases with more 
intense standards. However, they found no 
time error for brightness. (c) Newhall and 
Pugh asked subjects to select Munsell chips 
that matched a variety of absent familiar 
stimuli for which the average color values 
were known. They found hues shifted toward 
verbal norms; in addition, they found positive 
time errors for saturation and brightness. 

Bartleson (1960) ran an experiment simi- 
lar to that of Newhall and Pugh. He too 
found that hue shifted toward verbal norms. 
Positive time errors were obtained for satura- 
tion and brightness—for instance, red bricks 
got redder, grass greener. 
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Fic. 1. Kóhler's b-function for loudness of click 
stimuli. 
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Summary of Color Memory Research 


The early research b 


y Hanawalt and Post 
(1942) and Hamui 


and Landis (1955) was 
findings of positive, 
ime errors for satura- 
le data were obtained 
rch initiated by New- 


TIME Error RESEARCH 

In general, 
test memo; 
Christal’s (195 


functions are found 
Sons for all stimuli 


tha Should be expected 
that. similar error functions wil] exist in color 


Fechner’s m emory Image Theor y 


theory 
arized by Needham 
that it was Fechner 
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er- 

mulated an image theory for the conste ex- | 
rors, which he called me pee cording 
planation can be traced to Aristotle. ae 0 
to this concept, a subject obtains an i sulting 
the standard that fades with time, E. com- | 
in an apparent intensification of eod 
parison stimulus—a negative time source 9 
order to remove the time error as 1 methods: 
Systematic error in psychophysica balancin 
Fechner introduced the counter? uli at 
method in which the comparison rea stim 
presented before and after the standa 
lus an equal number of times. t that 1€ 

Needham (1934a) pointed og ro chnigu? 
searchers not only found Fechner’s limina 
inconvenient but it did not entirely € ie 
time errors, Moreover, the a ta 
inhibited research on time error Ie that | 
1920s (Pratt, 1933a), with the a n 
numerous critiques of the memory o ) cite 
cept were aroused. Postman e weigh in 
Studies with tones, brightness, anc mparis 
which time errors with imageless co 
were found. 


Kühler's Fading Trace Formulation ofa 


igation 

Wolfgang Köhler began an mwenn re in 
the time error problem in 1923, nO source s 
Psychophysical techniques, but as ie senso", 
valuable information about pcs In 0 
mnemonic traces (Marchetti, 194 wit m 
Study, Köhler tested 12 subjects that of 
stimuli of varying loudness. He pr ea | 
memory intervals to 3 seconds, the fre 
function was positive. At 3 seconds, the hi 
tion became zero, and beyond Dre T 
error grew progressively e meee 
function is shown in Figure 1. ing m€" of 

Kohler called this function ery sey 
time and time error a pala n He ise! 
interval (Needham, 1934a, irn e 0 "of 
pothesized that a lag exists betwee 
of the stimulus and the rune e " 
the H ions in the brain; the action F 
the positive portion of the ptun the ( 
man & Sharp, 1941). pei jons 
buildup, a continuous dispersion of 2 
similation) ensues. “This Hi is, qn 
was à non-process condition, t eed ye 

im! gh 


eU | 


" " te 
not have a conscious cor d E 
1934a, p. 236]." Kühler's d odii 
numerous studies and received 
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Visual research. Bentley's (1899) early 
Work with brightness revealed no time errors 
for 2- and 6-second memory spans. Increas- 
ingly more negative errors were obtained for 
Intervals over 10 seconds (tested to 5 min- 
utes), Koester (1945) indicated that Fodor 
and Happisch also obtained a negative time 
error for brightness with a 30-second interval. 

Peak (1939) presented standard and com- 
Parison stimuli only 177 and 285 milliseconds 
apart. In both cases she found positive time 
“rors for brightness, as predicted by Köhler; 

€ 285-millisecond interval had a slightly 
arger error, 
3 Kreezer (1938) successively presented light 
Eus to the same retinal spot or to opposite 

eS of the retina. Light to opposite sides 
Produced only a negative time error, while 
‘ght to the same spot yielded a decreasing 
positive curve, He theorized that the posi- 
Ke Portion of the p-function for brightness 

an artifact of a local “inhibitory afteref- 
fect.» 

An unpublished study by Crannel, reported 
: Tresselt (1944b), tested time errors in 
Do enis of the visual extent of successively 

jected dotted lines on a screen. He found 
Re the dots were remembered as being 
lc together than their true separation; he 

€rpreted this as a negative time error. The 
iy increased with longer memory times. In 
Nother study of extent, Allport (1931) moni- 
Se the successive reproductions of Binet 
th designs by 350 children. He found that 
Bs. reproductions grew smaller, and this was 

o interpreted as a negative time error. — 
(19 Inesthetic research. Payne and Davis 
du utilized a single subject to obtain 
Wei» On the successive comparisons of lifted 
m hts. The authors found a p-function iden- 
i^ to Kühler's; however, they interpreted 
e tesults in terms of fading muscle po- 

ùtials, 
Zeeman and Sharp (1941) obtained a p- 

nction similar to Köhler’s for intervals up to 
Wa Seconds; however, at 60 seconds the error 

5 less negative than at 30 seconds. Although 
ul agreed with Kóhler's theory, they also 
Sult that a major factor influencing such re- 
i. tS is “differential backlash excitation from 
© periphery.» 

einstein (1955) performed a study with 
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weights which is analogous to Kreezer’s same- 
or opposite-side retinal study. His subjects 
lifted weights either alternately with each 
hand or successively with one hand. The re- 
sults are also analogous to Kreezer’s because 
there is a more positive time error for the 
same hand (same area). This again indicates 
the possible importance of peripheral after- 
effects in determining the shape of the $- 
function. 

Auditory research. Needham (1934a) repli- 
cated Kóhler's experiment and confirmed the 
original findings, with the exception that the 
transition from positive to negative errors 
occurred at 2 rather than 3 seconds. 

Postman (1946) hypothesized that if time 
errors are the result of a fading trace, then 
one should predict that sounds varying in in- 
tensity will produce much larger time errors 
than sound varying in pitch, that the pitch 
of a faded sound will be identifiable, and 
that the original loudness level will be al- 
tered. The results support this hypothesis— 
there was a significant time error for loudness, 
but not for pitch. Tones varying in pitch were 
accurately matched even after intervals of 
20 seconds. 

Tresselt (1944b) reported that Danzfuss 
found memory for the aesthetic value of 
chords followed Kóhler's ?-function. 

Other research. According to Koester 
(1945), Fodor and Happisch tested time error 
during successive comparisons of salt taste. 
They found Kóhler's p-function for intervals 
to 40 seconds. 


Koffka’s Intensification Hypothesis 


In accordance with the principles of “point- 
ing" or “emphasis.” Koffka (1935) hypothe- 
sized that the intensity of stimuli (such as 
loudness, weight, or saturation) would in- 
tensify in memory, and to a greater degree 
with more intense stimuli, longer memory 
more familiar objects (Davis, 


times, and 


1952). 
Visual research. Warren and Shaw (1895) 


had subjects successively compare the size of 
150-millimeter squares with stimuli differing 
by a few millimeters. They obtained a large 
positive time error (apparent size increase 
in memory), which increased steadily with 
longer memory intervals (to 20 minutes). 
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Marchetti (1942) also tested time errors for 
the visual extent of lines and obtained a func- 
tion exactly inverse to Kohler’s; that is, it 
begins as negative and becomes increasingly 
more positive (line remembered as longer 
than actual) for intervals greater than 2 
Seconds. This result was confirmed by Mc. 
Clelland (1943). For the memory of color 
saturation, Collins (1931) found positive 
time errors. 

Kinesthetic research, Newhall (1932) stud- 
ied the judgment of pressure of cutaneous 
points stimulated successively and reported a 
Positive time error for intervals to 10 sec. 
onds. He cautioned, however, that his results 
were confounded by fatigue effects from the 
first stimulation Which would tend to produce 
à positive error. 


Auditory research, Needham (1934a) cited 
a study by Wada in 


Findings not Supporting @ Positive or Nega- 
tive Time Error Theory 
Visual research, 


Visu Katz (1935) predicted that 
stimuli would be e 


Xaggerated in memo 


fade, Katz 
is hypothesis, 

Successively 
sual stimuli such 
es, and triangles, 
nember different 
E as area or size 
Ositive or nega- 
ing upon the particular 
judged; no in- 
errors were ob. 
x nemory interval 
(3 minutes, 10 minutes, 7 days). 

Lesser (1959), in an experiment similar to 
Irwin and Seidentelg, had Subjects succes. 
Sively match broken and closed lines, ellipses 
Squares, and circles after l-, 3-, and 8-second 
memory intervals, He found (a) an increasing 


presented evidence to Support h 
Irwin and Seidenfeld (1937) 
presented different types of vi 


as dotted and solid lines, circ] 
Subjects were 


Bes in the n 
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ipse; 

negative time error for the pe pet. 
(b) a positive time error for t is to zero 
square at 1 and 3 seconds ie He other 
at 8 seconds; (c) no time errors for f inter- 
figures. Lesser argued for some type a sur- 
action between the standards and tion on 
round. We see in the following red cor- 
context effects that Lesser was proban d wa 
rect. An experiment similar to ee foun 
performed by Crumbaugh (1954), T disi P 
a positive time error for a gap in a follow 
widening of the gap) to 5 oy simila 
by an increasing negative tüngtton. i tilt in 4 
result was obtained for the angle 0 es grew 
line. Other results showed that unl d 
more circular (interpreted as a on we more 
error) and unequal circles grew ual wit 
equal (negative time error) then uneq 
longer memory intervals. . 

oer (1944b) also tested time bt 
visual extents. She found large nega o errors 
errors for broken lines or circles E author 
when these figures were solid. The are the 
argued that stimulus cipractenisia n. tha? 
determining factor for time errors ra ry trace: 
a fading or intensification of a ape circles 

Karlin (1953) presented black so an in 
to subjects on white cards. He nei d 
creasing negative p-function for the ised in ? 
Son of sizes, The entire curve was a resent 
positive direction with increases in l 
tion time (1, 3, 5 seconds) for the vestigate. 

Andrews and Robinson (1948) iny ri he) 
time errors in the Müller-Lyer illusio! e 


r 
tanda! ed 


ves involve” 
Successively presented the two agures ^ "funt. 
in the illusion and found a n an 
tion (decrease in the illusion), whic interv 
increasingly positive for memory à 
greater than 6 seconds. 914) foumi 
Auditory research. Vance (1 negligi” " 
that male subjects showed mostly n6f, e 0 


time errors, whereas females had ye mat 
negative time errors, which varied sY" th 
cally according to the frequency Qin 
Standard, but a 
Koester (1945), contrary to be net 
agreement with Postman (1946), fo alts 1579 
sible time errors for pitch. His open D 
cated that constant errors for ego E 
ments are somewhat larger n ing 
pitch but do not systematically re^ 
or direction to interval length. 
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Pratt (1933b) tested time errors in audi- 
tory intensities with an absolute identification 
Method in which subjects rated the standard. 

ime errors were determined by subtracting 
the average rating for the series from the 
Mean of the rating scale values. Contrary to 

atz’s theoretical predictions, Pratt found 
that loud stimuli produced negative time er- 
Tors, while positive errors were obtained with 
Soft stimuli. 

Other research. Woodrow (1933a) found a 
arge interaction between time error direction 
and subjects for a temporal spacing task. 

ubjects were evenly divided between syste- 

ie positive and negative p-functions; the 

ect of averaging these subjects was an over- 
all -function close to zero. 


Context Viewpoint 


Researchers holding a context viewpoint 
Nad found the simple trace explanations for 
ae errors such as those of Kéhler (1923), 
apika (1935), and Katz (1935), greatly at 
~ oS and guilty of gross oversimplifications. 
ie of the theories, for example, explained 
fron apparent fading or intensification effects 
b uce time errors no larger than about 2056 
un of the enormous errors that should 
int i tif a trace completely faded or greatly 
cor iied. Apparently, traces do not fade 
a pletely or intensify greatly, or there are 
(ditional mechanisms involved. 

. ‘he context view holds that a pair of stim- 
E is not presented in a vacuum; rather, the 
"n uation of the comparison stimulus is af- 
E by previous stimuli, such as the stan- 
the 4 the level of stimuli in the series, and by 
Scri absolute scale of stimulus meaning pre- 

ibed by experience [Osgood, 1953, p- 93]." 

‘esearch on context effects may be con- 
omlently divided into two categories: series 
iter and assimilation effects. Series effects 
Stin ve time errors that are the result of prior 
in lation or of the magnitude of the stimuli 

a € series, Tncluded here is what Osgood 

S labeled as the central tendency effect: 


q 

NUS in the impression of any standard during 

the -V (standard-comparison) interval tend toward 

Co at lage magnitude of stimuli previously en- 

Stange eds weak standards augmenting and strong 
ards fading [Osgood, 1953, p. 971. 


© may recall that Katz advanced a trace 
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hypothesis that predicts exactly the reverse. 
Assimilation effects include time errors at- 
tributable to the effect of background stimula- 
tion or stimuli interpolated between the stan- 
dard and comparison stimuli. 

Series effects—visual research. lpsen, ac- 
cording to Osgood (1953), reported a central 
tendency effect for visual extents, small stim- 
uli producing positive time errors while large 
stimuli produced negative errors. 

Series effect—kinesthetic research. Fern- 
berger (1931) found a large central tendency 
effect in an experiment employing an absolute 
judgment method with weights. If light 
weights were preceded by a heavy weight, they 
were rated as of intermediate value (a migra- 
tion toward the preceding stimulus). Preced- 
ing heavy by light weights also produced 
judgments toward an intermediate value. 
Woodrow (1933b) also employed an absolute 
judgment method and found that large in- 
creases in the magnitude of negative time 
errors with weights could be obtained by in- 
creasing the weight of the standard. Woodrow 
(1933b) followed his absolute judgment ex- 
periment by a carefully controlled study em- 
ploying successive comparisons. He used 
standards which were either fixed at 110 or 
200 grams or were varied randomly within 
the same range. A negative time error of 14% 
was found for each fixed set and as the aver- 
age of the variable set. However, an analysis 
within the variable series revealed a large 
central tendency effect: Positive time errors 
were obtained for the lightest weights (110- 
120 grams), zero errors for 130-140 grams, 
and very large (49%) negative time errors 
occurred at the high end (190-200 grams). 
Woodrow concluded (a) when stimuli are 
used in a series a central tendency effect oc- 
curs; (b) a sinking (fading) of the entire 
series occurs in the memory. When the stan- 
dard stimuli are fixed, there is no series effect, 
and only the sinking effect occurs. 

Series effects—auditory research. Needham 
(1935a) found a marked central tendency 
effect for sound intensity, which increased 
with lengthening of the memory interval. He 
found large negative time errors for 50-decibel 
and 60-decibel standards, while positive er- 
rors were obtained with 30-decibel and 40- 
decibel stimuli. In another study (1935b), he 


We 
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found that by preceding 30-decibel and 40- 
decibel standards with a 60-decibel stimulus 
he could exaggerate the positive time error 
normally obtained. This result is consistent 
With Fernberger's finding that standards are 
drawn in the direction of preceding stimuli. 
À strong central tendency effect using an 
absolute judgment method with sound in- 
tensity was obtained by Pratt (1933b). He 
repeated his experiment, but substituted a 
Successive comparison technique; the central 
tendency effect diminished. 

Koester and Schoenfeld (1946) reported 
an extremely large central tendency effect for 
pitch (positive around 1000 Hertz [Hz]; 
negative at 1120 Hz). The effect increased 
with the length of the memory interval (1 to 
6 seconds). The result is interesting in light 
of the total lack of time errors reported a 
year earlier by Koester for pitch and loud- 
ness based on 7,342 observations, Finally, 
Long (1937) found that an increase in sound 
intensity was created in the memory by pre- 
ceding a loud standard by a soft stimulus. 
This result conflicts with those of Fernberger 
y, who found the 
eding stimuli, 
al research, Mar- 


the size of the cards 
r cards increased 
error (an assimilation to- 
ward the background), while decreasing the 


card size decreased the magnitude of the 
positive time error, 


Freeman and Sharp 
Lauenstein found positi 
lines drawn on White ca 


compared to a 100-millimeter Standard. When 
the standard and Comparison stimuli were 
changed to light traces in à darkened room, a 
slight negative time error was obtained a 


(1941) reported that 
ve time errors when 
rds were successively 
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al 
result apparently attributable to the s 
of the background. This interpre 
supported by McClelland (1943), who timuli 
the size of the frame for comparison ^. 
and the 100-millimeter standard. ve. to 
an apparent assimilation of the stan 
the frame. ‘ect 
Tresselt (1944a) theorized that te N 
of background would be greater if th to the 
stimulus was similar (homogeneous) o 
background, He compared time ed squares 
black outlined squares in black pe lines i | 
(homogeneous condition) with black condi 
square black frames teer. of size 
tion); there were also two condition keroun 
of stimulus objects on the same bac ds time 
sizes. The hypothesis was supported ificant; 
errors for the black lines were not Ful sig- 
while the small black squares peace’ plack 
nificant negative time errors and larg 


ination 
ss 
with 
(1944a) found a similar contrast ack- 
very small squares on a very larg i 
ground. > rasoti 
Assimilation effects—kinesthetic :milatio" 
Guilford and Park (1931) found am 
of the standard to interpolated s: arch 
Assimilation effects—auditor y 32) 1 
Lauenstein, according to Pratt (193: conte” 
One of the earliest proponents of the nilatioP 
Position, arguing for a total ass! ghle” 
theory of time error. He replicated tica E 
click experiment and obtained we q jou 
sults; however, when he interpolate aris 
sound between the standard and e pe 
stimuli, a large positive time ace nee i 
Lauenstein argued that Köhler’s larg stan E. 
tive p-function occurs because the ee! E. 
üssimilates to the zero stimulus bai is wi 
standard and comparison stimulus. " hol — | 
results demonstrated that the princip 


p”: 
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if a background stimulus is presented. A back- 
round louder than the standard resulted in 
a positive time error while a softer back- 
Bround gave rise to a negative error. 

Pratt (1933a, 1933b, 1936) tested Lauen- 
Stein’s hypothesis in a series of studies. He 
interpolated either loud, soft, or no sound 
between the standard and comparison stimuli. 

Positive time error was obtained for the 
Sud interpolation and negative errors for the 
Soft and null interpolations; the time error 
Tom the soft interpolation was larger than 
or the null interpolation, a result supported 
Y similar findings already cited. Pratt argued, 
as had Woodrow, for a combined trace and 
assimilation theory where interpolated loud 
and soft stimuli produce an assimilation in 
Addition to a sinking effect. (When the time 
errors of the 20% louder and softer inter- 
Polated stimuli were averaged, a net negative 
Me error existed.) Only a sinking effect 
Occurs with a null interpolation. Needham 
(1934a, 1935b, 1935c) also argued for a com- 
"ned sinking trace-assimilation explanation. 
m found the effect of an interpolated stim- 

S wore off during longer memory intervals 
a an increasing negative p-function inevit- 

Y occurred. 
sini land (1958) advocated a revised as- 

‘lation position. He contended that traces 
E. hot sink; rather, they exist in "persistent 
Bue that are pulled toward | interpolated 
E nu, He tested his contention with an 
,Deriment in which there was either no in- 

TDolated stimulus or one with the same 
alue as the standard; he argued that Pratt 
Bd predict a greater sinking trace for the 
NE condition. Bowland found no DIU 
NS for either case, and maintained there- 
1 = that traces do not sink but only assimi- 

e. However, he is in the dangerous position 

accepting the null hypothesis to prove his 
Vide! moreover, his lengthy dissertation pro- 
cie an excellent argument for reviewing the 
k ature—he picked a memory time of 3 
Conds which is the exact point in Köhler’s 
te ction where no time error occurred. His 
best ts must be considered inconclusive at 


à Quantification of context effects—Helson’s 
te Ptation level theory. Helson (1947) ex- 
ded his adaptation level (AL) theory to 
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cover the effects of series, background, and 
interpolated stimuli. He theorized that suc- 
cessive comparisons take place within a frame- 
work where a neutral point or adaptation 
level exists. Context stimuli alter the adapta- 
tion level affecting the comparison according 
to the following expression: log (AL + K) = 
(3 log X;/z + log C)/4, where K is a con- 
stant depending on whether the standard or 
comparison stimuli are presented first, X; is 
the value of each stimulus judged, C is the 
value of the standard, and z is the number 
of stimuli in the series. Helson analyzed 
Pratt's and Fernberger's data and found that 
his equation described their results exactly. 
This formulation provides the interesting con- 
clusion that negative time errors may arise 
because subjects geometrically average stan- 
dard, comparison, and context stimuli, and 
as the geometric mean is usually beneath the 
arithmetic mean, the difference would appear 
as a negative time error. Helson's formulation 
does not include time as a parameter; there- 
fore, it covers only one point of the p-func- 
tion. It would probably not be too difficult to 
introduce the time variable into the equation 
to complete the analysis. 


Adaptation View 


The simplest explanation (Osgood, 1953) 
for time errors was considered in 1899 by 
Muller and Martin (according to Needham, 
1934a) and by Newhall et al. (1957). Sen- 
sory adaptation or fatigue from exposure to 
the standard could result in two possibilities: 
(a) if the sensory organ was desensitized and 
the comparison stimulus followed rapidly be- 
hind the standard, the comparison stimulus 
would appear too weak, producing a positive 
time error; (5) if the sensory organ had 
enough time to recover, the standard, which 
would be remembered near its adapted level, 
would appear weaker than the comparison 
stimulus, and a negative time error would 
result. This explanation would parallel the 
typical p-function first found by Kóhler but 
would be difficult to apply to nonintensive 
stimuli variations such as pitch or hue. Little 
data have been advanced to separate this 
alternative from trace or context effects ex- 
planations. 
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Other Factors 4 fecting Time Errors 


Practice, Kóhler (1923) found the positive 
Portion of the #-function to diminish with 
practice, Needham (1934b) found a diminu- 
üon in time errors with practice as well 
(1934a) as a tendency for the -function to 
become positive. Karlin (1953) confirmed 
Needham's results, finding 
the size of time errors with practice, 

Set. In 1899 


$-function found by Kühler 
Sharp, 1941; Payne & 
authors Suggested th: 


r and retina] potentials; 
h 


t has been expended to 
he periphera] Concomitants of 


rating of the Standards (which , 
Sented an equa] number of tim 
mean of the rating 
time error, 


es) and the 
scale yields a measure of 


color masses of the s 
had found, as Philip (19 
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in negative errors, When a successive Es 
parisons method was employed, however, 
central tendency effect disappeared. 931) 
Woodrow (1933b) and Fernberger Utd 
found overall negative time errors for dodi 
weights with an absolute judgment me eh: 
moreover, they found a central tendenon 
fect. Wever and Zener (1928) STR for 
absolute and relative judgment method PE 
lifted weights, They found a negative ee was 
tion for both methods; the time a e 
larger for the absolute method (6% wm oth- 
to 4% for the relative method). They ele o 
esized that with time the entire Ms the 
weights lifted in the past fades, shifti eative 
Subjective scale downward in the neg tak- 
time error direction. Needham ees hy- 
ing note of Wever and Zener's resu 1 abso- 
pothesized a shift from a relative to a long 
lute judgment comparison for a P ex- 
memory intervals in relative comparis 
eriments, ind 
: Stimulus fadeout. McClelland (1943) TR 
time errors were caused in part by the gr - 
of stimulus onset and offset. The slow xperi- 
off of projectors during a visual extent Ei. 
ment produced a negative time ero" iting 
Clelland noted that in most weight- dow” 
experiments, subjects put the weights teste 
rather than abruptly dropping them. me dards 
the effect of abruptly dropping the qe con- 
and found a slight positive time error (o all 
trasted with the negative time error simila" 
obtained). It was hypothesized that xperi* 
effects could be found in auditory € 
ments due to speaker fadeout. "o priatit!l: 
The effects 0f brain injury and incbr ^ the 
Weinstein (1955) reported research ^ time 
effects of brain injury and inebriation east ell 
errors. Among the research cited by We renee? 
Specht found inebriation drastically in timuli: 
negative time errors with auditory p ob- 
Similar results with visual stimuli an E 
tained by Lange and Specht. Large P'ction- 
time errors with disorders of sensory fu sack? 
ing were observed by Stein and beet 
(according to Weinstein), while W E ed 
found negative time errors which in s, wh? 
with longer memory times for person 
had lesions of the parietal lobe. nods of 
Psychophysical methods. The Es ent 
limits, constant stimuli, absolute judg 


-— 


COPA error have most frequently been 
A col etermine time errors. The researchers 
m. Sir have favored the average 
dm chnique, while the time error studies 
sting, USUally been based on the constant 
pn method. 
e ange (1947) sought to determine the 
com Oi using the average error method as 
E. I to the constant stimuli approach. 
E od Kohler’s study several times, 
bine d the constant stimuli technique; he 
Eaubon Kühler's results each time. When 
t Pain an average error method, he was 
only sli to find any negative time errors and 
an im ight positive time errors. Postman noted 
Er ua ame between the methods 
Method ing time, When the constant stimuli 
is employed, maintaining times nor- 


ma 
d are from 1 to 4 seconds. On the other 


ne d A 
Which Onsiders that the comparison stimulus, 
Match, 1s under adjustment throughout the 
» e Ch interval, acts basically as a long dura- 


on 
i Mterpolated stimulus. 


ai or Mrmory Time RESULTS 
v and Magnitude of Time Errors 
ti stdies involving the effects of memory 
based a be separated into two categories 
leq. on the type of stimulus dimension var- 
ligo, 2) Quantitative, where an intensity 
al B is varied that alters such phenome- 
tion tibutes as loudness, brightness, satura- 
| Varia d weight; and (5) qualitative, where 
— Suc lon of the stimulus dimension changes 
vig, Phenomenal attributes as hue, pitch, 
ig extent, and geometric configuration. 
a classification is similar to S. S. Stevens’ 
etat distinction between prothetic and 
Or ao continua. An attempt was made to 
Sibinig € results of studies involving the pos- 
ang Y Of time errors into these categories 
th, ccording to the predominant trend of 
no time errors, negative time 


Etro results: 

r: - e 

i 3, positive time errors, approximately 
Qua] aud 


Sviq Positive and negative time errors, 
ence that the time error depended on 


€o 
ext variables. The tabular results are pre- 
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TABLE 1 
TABULAR BREAKDOWN OF MEMORY TiME RESULTS 
Time error Color memory 
research research 
Predomi: - = : " 
on Stimulus dimension varied 
Quanti- | Quali- Quanti- | Quali- 
tative | tative | tative | tative 
No error 1 0 3 3 
Negative 12 4 2 0 
Positive 2 4 10 1 
Equal positive and 
negative 1 4 0 1 
p-function depends 
on context 17 9 — — 
"Total 33 21 15 5 


sented in Table 1. (A tally was made for 
each result, not just for each study; classifica- 
tions are sometimes difficult to make accord- 
ing to these crude categories.) 

We may conclude from Table 1 that the 
time error literature indicates the prevalence 
of a negative /-function where quantitative 
stimulus dimensions were varied; however, a 
strong interaction with context is obvious. 
This conclusion is in conflict with the general 
trend of the color memory research. No clear 
trend seems to exist for time errors with 
qualitative variations in stimuli; however, 
stimulus context appears to be equally im- 
portant for qualitative as for quantitative 


variation of stimuli. 


The Effect of Context 

1. Central tendency. Nine studies tested 
for a central tendency effect; of these, eight 
observed it. Models hypothesizing the migra- 


tion of memory traces toward a norm appear 


to be useful. 
2. Previous stimuli. Two of the three 


studies reporting the effect found an apparent 
assimilation of memory traces to the mean of 
stimuli presented prior to the standard. 

3. Background. Five studies were conducted 
to determine the effect of background. All re- 
vealed an apparent assimilation of the mem- 
ory trace to background stimuli. 

4, Interpolated stimuli. Of the nine studies 
investigating interpolation, eight indicated an 
apparent assimilation of the memory trace to 
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the interpolated stimulus; the dissenting 
study reported a contrast effect. Assimilation 
apparently occurs until the differences become 
So large that they trigger a contrast effect, 


Other Considerations 


1. Practice appears to reduce time errors. 
There was some indication that highly prac- 
ticed subjects may make significantly more 
Positive errors, 

2. There may be a peripheral component to 
time errors, Several studies reported muscle 


potentials correlated with time error p-func- 
tions, 


average error psycho- 
Large negative errors 
former, while a slightly 
as obtained with the 


ESEARCH ON MEMory Timme 
ESSIVE JUDGMENTS 


cessive j udgments: j 


1. Control of context stimuli, Many experi- 
menters haye neglected to control for context 


stimuli. This may be Particularly important 
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) J 
in color judgments. Little concern s | 
shown for background brightnesses a re 
geneity, stimulus intensity, or stimuli P 
ceding or following the standard. cons 
2. Adaptation and afterimages. Few ts 0 
trols have been introduced for the Ww 
Sensory adaptation or afterimages pu. 
1939). For short memory intervals the sdeatl 
of adaptation and afterimages would be + rise 
cal in color memory—both would Pearle 
to positive time errors. As noted error: 
adaptation might lead to negative pups olate 
for longer memory times. Strong A uce 
or preceding series stimuli probably Po l 
Some adaptation and moderate c: uer 
resulting in an apparent assimilation 
the intervening stimuli. robably 
The color memory results were P rimage 
confounded with adaptation and pee 
effects. Presentation times (for the pe. 3); 
tended to be lengthy (up to 15 pon and 
probably producing moderate pag qi aver- 
vivid afterimages, Moreover, use p a colo 
age error technique, by maintaining "yy, 
stimuli during the memory interval, fterima£ 
edly led to large adaptation and a ptained 
effects; the large positive p-functions © r these 
by the color memory researchers bolste 
contentions, irradiatio? 
Sperling (1960) noted that Dy charac 
afterimages might alter nonintensive ; 
teristics such as object size. eni paid 
3. Practice, Scant attention has be or 
to practice effects by either color ae errot? 
time error researchers. The small d e the 
reported by many investigators migh Y 
result of practice, sic 
4. doae factors. (a) Psy chop by 
method. Cited earlier were data le be 
Postman that indicated large diete y con” 
tween the methods of average error majority 
Stant stimuli techniques. Since the e 0 the 
of color memory research involved uS ig 


r H 
à e time er ji 
average error technique while time im" 


st 


Searchers employed the constant, ri sul 
Method, the apparently discrepan ere? 
QUSE in part be attributed to this differ ts 
Siegal (1962) found the constant “es 


method to be the most sensitive d p? 
discrimination threshold values) of we jf 
chophysica] techniques. Moreover; be cep! 
noted the average error technique is 


i 
S —' 


ble to contamination from stimulus adapta- 
lion and afterimages. Large assimilation ef- 
fects (toward the origin of the series) have 
een reported from the few studies using the 
Method of limits. The constant stimuli 
method, therefore, appears best for time error 
research, 
(b) Calculation of time errors. Several 
Methods exist for calculating time errors for 
the constant stimuli techniques: The simple 
r% method, freehand and sliding average 
Ogive fits, normalized straight line fits, as well 
as weighted and unweighted normalized least 
Squares fits, can be utilized (Guilford, 1954). 
he time error and color memory studies have 
Used many of these techniques without con- 
cern; it is inconceivable that no differences 
exist among them; moreover, the results may 
not be comparable to the time errors obtained 
through average error or limits methods. 
(c) Individual and sex differences—the 
Consequences of averaging. Little attention 
a been focused upon the question of in- 
E dua or sex differences. Woodrow (1933b) 
Ported an even split for subjects: half had 
pats p-functions and half, negative ?-func- 
p No evidence has been presented to 
OW the sexes are equivalent. With these 
questions unresolved, averaging has obvious 
angers, Techniques devised for the practical 
Problem of reducing time errors may not hold 
ìf based upon averages; individual selection 
d training may be more fruitful for this 
Practica] purpose. 
g (d) Statistical evaluation. Little of the re- 
earch on color memory or time error has been 
auiected to the scrutiny of statistical evalu- 
lon; usually, only mean time errors have 
Sen reported. There is actually little evidence 
9 discredit the argument that each body of 
idence is actually the product of guessing 
Tom subjects biased toward a particular re- 
t. Only the frequency of similar conclu- 
ns leads us to a contrary belief. 
(e) Sample size. Most of the studies cited 
p utilized very few subjects (which is 
ha Pably the reason few statistical analyses 
Ve been attempted). Hamui and Landis 
(1955) used two subjects; Newhall et al. 
1957), three; Payne and Davis (1940), only 
ne, Bowland (1958) used 26 subjects, but 
Bave each only three presentations of a spe- 


sio 
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cific stimulus. Sample sizes as small as these 
are hardly reliable enough to make the broad 
generalizations typically made by the investi- 
gators. 

(f) Weber's law. Except for Helson's treat- 
ment, there has been a total disregard of the 
relativity of stimulus change quantified by 
Weber's law. Changes in low-intensity stimuli 
have been treated equally to changes in stim- 
uli of high intensity. We must relate the mag- 
nitude of a stimulus change to the absolute 
stimulus level. “To dismiss this law because it 
does not hold over the entire range would be 
to obscure a fundamental fact about sensory 
mechanisms [Osgood, 1953, p. 76]." Aver- 
aging large and small stimuli, divided by their 
absolute magnitude, may no longer result in 
an overall negative time error as has been 
previously shown. Many of the central tend- 
ency effects are undoubtedly contaminated 
with this error. 

(g) Guessing and response bias. Little at- 
tention has been paid to the problem of con- 
tamination from guessing and subject bias. If 
a subject cannot sense differences among the 
comparison stimulus alternatives and is forced 
to guess, his -function will appear as a 
horizontal line centering close to a zero time 
error. Response biases will move the p-func- 
tion markedly above or below zero. It is diffi- 
cult to estimate what proportion of the color 
memory and time error results are confounded 
by guessing and bias, but it is interesting to 
note the tendency for the p-function to go to 
zero at long intervals where subjects might 
be expected to rely most heavily on guessing 
(Karlin, Lehmann, Wolfe, Whipple, Wada; 
in Needham, 1934b). 

5. Fatigue. Bowland (1958) noted that 
experimenters have typically given little con- 
sideration to the length of their experiments; 
time error studies are extremely fatiguing. 
Fatigued subjects cannot be expected to pro- 
vide adequate data. 

6. Fixed versus variable standards. In the 
majority of the studies, a single fixed standard 
has been repeatedly presented. Tn real life, 
observers are often called upon to make com- 
parisons with standards that are different 
from one judgment to another. In light of 
practice and series effects, it is probable that 
large differences exist between judgments 
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made with fixed standards and those made 
with variable standards. 

7. Stimulus fadeout. As McClelland (1943) 
found, large time error differences can result 
by varying the rapidity of removing the stan- 
dard. It is reasonable to Speculate that both 
onset and offset of both standard and com- 
parison stimuli may have considerable effect 
on time errors. These variables have been 
largely uncontrolled, 


Movet Srupv or SATURATION MEMORY 


In a series of carefully controlled studies 
of saturation memory for a single 538-milli- 
micron stimulus, Springer and Tate: at- 
tempted to take these problems into account. 
A constant stimuli 
with memory intervals of 1, 4, 14, or 30 sec- 
Onds. There were se 
in which the degre 
variability, 


or differences amon 
analysis of variance) 


values ranged from 99%, 
decided to reject all subj 
ence limen of 16% 
in 16 of 60 
Procedure js 
interested in 
able to make 
An analysis o 


to 1296, and it was 


carded. This 


asonable if we are 
saturation m 


good satura 
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y 
positive time error at the 1-second eu 
interval, which changed to zero at 5 B. the 
and became more and more negative a up- 
15- and 30-second intervals. This indinga 
ports the results of previous pus but 
studies on quantitatively varied pe. Tol 
conflicts with color memory experimen mory 
sults. Both the 15- and 30-second me zero 
groups were significantly different ar well 
(ie., significant time errors). Not ng ad no 
the analyses significant where they a — 
been before, but differences in depu ques 
sulting from different curve-fitting tec uncor- 
were less for the corrected than the 
rected data. 7 ere that 

Other findings of the studies me error 
practice with feedback reduced the Metodi 
to insignificant amounts for all ae gave 
the individual “reliable” subjects, tice. 
more positive time errors with pre cantly 
was also found that males had edt in 
more positive time errors than fema ther E 
of 16 experimental conditions. p mor! 
difference was that women tended to while 
accurate at the shorter memory pem in- 
males were more accurate at the lona 
tervals used. The fact that in the ga ive 
data more subjects tended toward P^. ed 
time errors may be a clue to esc et oF 
in studies by Karlin (1953) and o ction 3t 
time errors to shift in a positive ae in- 
longer memory intervals. At longer and te 
tervals, one would expect guessing henc? 
sponse biases to be more prevalent an 
beget unreliable data. 


CONCLUSION Foot 
ee 

The Springer and Tate study d é 

note 3) clearly demonstrates the n edi th 

taking into account factors mention 


: P r stu t 
Physical methodology than just for ren 


ʻo appa 

errors in successive judgments. It 1 a d E 
that the exercise of careful contr 1 matt 
number of variables is a very ato P 
in any study involving a psychophy are v. 
Cedure; the fact that such proscar psy bo 
of the oldest branch of experi = nav 
°8y methodology does not mean tha 
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Yet learned, much less practiced, all the refine- 
ments desirable in these techniques. Our 
analyses of psychophysical data have also 
left much to be desired. Certainly no study 
Should ignore, as they usually do, consider- 
ations of the effects of individuals and sex 
differences and the inclusion of poor discrimi- 
hators in the data; such data can give con- 
fusing and misleading results. 
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COMMENT ON ALTRUISM: 
THE COMPARATIVE EVIDENCE 


D. O. HEBB '! 
McGill University 


Krebs’ dismissal of the evidence of altruism in infrahuman mammals is criti- 
cized. It is proposed that this is in fact the only decisive evidence, in view of 
the difficulty of interpreting the human data. 


Krebs (1970) has obscured the picture of 
altruism. He dismisses the comparative evi- 
€nce as mostly anecdotal, which it is not, 
and though he agrees that one animal may 
Sive help to another he feels that no conclu- 
inus are warranted because “the reasons 
ehind [such behavior] are almost as ambigu- 
Us as those behind the behavior of humans 
T». 262].” Almost, but not quite: The fact 
d „that the only decisive evidence of the 
Xistence of altruism, as such, comes from 
€ large-brained infrahuman mammals. 
pur he primary problem is whether there is 
oon behavior in which one animal helps 
Wealth. at some cost to himself (in material 
, such as food, or in the expenditure 
ing, TEY), when the behavior has not been 
luced by reward or by threat of punish- 
ant. Superficially, such behavior is common 
on man, but no conclusive argument based 
the man data has been possible because of 
th Complexity of human experience and all 
* situations in the past in which the child 
bo been rewarded for giving or scolded for 
in Blving, Here the animal data are decisive 
op towing that altruism can occur. Nothing 
er animals do proves anything about man, 
titi such evidence does refute the principal 
ar icism of the human data: that doctrinaire 
or ment, that altruism in the true sense can- 
(ba exist, that all motivation must be selfish 
heeds) ultimately on primary biological 
tpe idea that the evidence of Hebb and 
We Mpson (1954) was anecdotal is absurd. 
D referred to the behavior of a dog that 


is) . : * á 

Vented children from swimming, by per- 

1 

th Requests for reprints should be sent to the au- 

M ; Department of Psychology, McGill University, 
Teal 101 Quebec, Canada. 


sistently “rescuing” them despite never having 
been taught anything of the kind and despite 
repeated punishment, as an example of the 
well-known guarding and protective behavior 
often seen in dogs, and Krebs perhaps con- 
siders this to be anecdotal. Our main evidence, 
however, was experimental. We quoted Nissen 
on the experiments of Nissen and Crawford 
(1936) and reported some results from an 
experiment by Hebb (1949). The latter pro- 
vided a clear example of concern by one adult 
chimpanzee for a friend's welfare, shown by 
attempting to pull her away from a place 
of risk. 

Krebs further implies that unsolicited help 
or food giving is rare in the chimpanzee, and 
that if the help has been solicited, it is not 
altruism. The first is not so, and the second 
is hard to understand. Nissen and Crawford 
reported a number of examples of unsolicited 
food giving between the young animals of 
their experiment, and it is also recorded in 
the diaries of adult animals of the Yerkes 
Laboratories of Primate Biology. On the 
second point, is it not generosity to give food 
when requested? It may be more generous to 
anticipate another's need (as Nissen and 
Crawford’s half-grown animals in fact did), 
but giving without return is altruism, whether 
induced by begging or not. Krebs emphasizes 
also that the animals gave the less attractive 
food; but this, instead of casting doubt on 
the existence of altruism, just makes the case 
more credible. 

Finally, it appears that some of the food 
giving was imitative: “a result of modeling 
[Krebs, 1970, p. 262]." 'This suggests one 
source of altruism, but altruism does not 
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become less altruistic when we understand 
how it comes about. In my own opinion, em- 
pathy is involved (for evidence of chimpan- 
zee empathy, see Kóhler, 1927, Plate IV), as 
it is in imitation, so it is quite reasonable to 
find an interaction of altruism and imitation. 
It is clear, however, from Nissen and Craw- 
ford's data and from my own experimental 
observations (referred to above) that altru- 
istic behavior can occur in the absence of 
model to imitate. 

As I have said, finding that Something is 
true of other animals does not prove it true of 
man; but if there are suggestions of altruism 
in the dog, and clear evidence of it in larger- 
brained animals such as chimpanzee and 
porpoise (Kellogg, 1961: Siebenaler & Cald- 
well, 1956), it suggests at least that we might 
look for it in another large-brained animal— 
man. In fact, it is well known in man; but 
it has traditionally been explained away as a 
product of childhood training on the basis of 
the classic assumption that the only unlearned 
motivations must be selfish. The comparative 
evidence is crucial in one respect: It shows 
that altruism may arise in the chimpanzee, in 


any 


0 
some modest degree, where there has bea 
training in generosity. On any reasonable eo i 
this requires a reinterpretation of AE 
tional hedonistic, law-of-effect view of 
nature and human motivation. 
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INFRAHUMAN ALTRUISM 


DENNIS KREBS: 


Harvard 


Hebb's critique of Krebs’ (1970) 
is examined. It is suggested that 
evidence for helping 


assumption that the helpi 
lacking. It is concluded that evidence 


gone furthest to explain altruism. 


D. O. Hebb's (1971) critique of Krebs’ 
(1970) review of the comparative evidence 
Or altruism raised a number of issues that 
Warrant examination. Hebb addressed the 
Problem that traditionally has been most 
Closely associated with the study of altruism— 
pe Droblem of its existence. Hebb argued 

at the conclusion reached by Krebs 
(1970) —that comparative research has mot 

Stablished the existence of altruism in infra- 
phe was misleading. (It should be em- 

asized that Krebs, 1970, did not dismiss 

evidence.) Hebb went on to argue that 
ae only decisive evidence for the existence of 
aN uism comes from research on large-brained 
tahuman mammals. The evidence adduced 
y Hebb, however, was mostly descriptive. 


Without exception, it failed to explore the 
ingly altruistic 


potivational bases of the seem! 
havior it described. In fact, recent research 
e humans—research concerned with the ef- 
i of empathic reactions on altruistic behav- 
—has gone furthest to explain altruism.” 
sho efore discussing altruism, its criteria 
i be clearly specified. The criteria em- 
S ved by Hebb require that the motivation 
(H the behavior in question be “intrinsic 
ha ebb & Thompson, 1954) and that the be- 
vior not be “induced by reward or by threat 
Punishment [Hebb, 1971, P. 409].” 
trinsic seems to imply innate. However, 
ai b and Thompson (1954, P- 542) enter- 
ned the notion that altruism might be a 
be sent to the au- 
Relations, William 
Cambridge, Massa- 
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experienced affect and 
Harvard 


behavior in infrahumans, 
ng responses were based on altruistic motivation is 


University 


review of infrahuman research on altruism 
although there is a good deal of descriptive 


experimental support for the 


from human research on empathy has 


"learned drive." But inasmuch as learned 
drives are a function of reinforcement (Miller, 
1951), the first criterion is inconsistent with 
the second. 

The separation of learned and innate com- 
ponents of behavior has presented a classic 
problem for psychologists. Although the study 
of infrahuman animals has supplied one way 
of researching the problem, none of the studies 
reported by Hebb (1971) addressed the issue. 
Hebb and Thompson (1954) seemed to sug- 
gest that the phyletic increase in aiding be- 
havior is due to a corresponding increase in 
an innate predisposition toward altruism. The 
increase in helping, however, could be due to 
any of a number of the differences among the 
animals in question, especially differences in 
learning experiences. 

Hebb's second criterion forms the basis of 
his argument, but, as pointed out earlier 
(Krebs, 1970, p. 260), the criterion is overly 
restrictive. It is logically impossible to estab- 
lish the absence of reinforcement, because it 
entails proof of the null hypothesis. Even 
though infrahumans are not exposed to formal 
socialization practices, the conclusion that 
their helping behavior is not a function of 
reinforcement is not warranted. It seems quite 
likely that both humans and infrahumans re- 
ceive rewards for helping and punishments for 
not helping, quite independent of formal 
socialization. 

Several points need to be made about the 
evidence adduced by Hebb in support of his 
conclusion that infrahumans behave altruisti- 
cally. To begin with, Hebb and Thompson 
(1954) recognized the deficiencies of anec- 
dotal evidence: “There is a great deal of 
anecdotal material to suggest the existence of 


altruism and cooperation in primate societies 
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- - - v but the evidence in general has been 
so poor that Zuckerman (1932) had to con- 
clude that no altruism exists . . . [p. 541].” 
Hebb and Thompson (1954) then went on 
to present what they described as “experi- 
mental” support for the suggestion that chim- 
panzees and porpoises behave altruistically. 
None of the studies, however, supplied experi- 
mental support for the existence of altruism, 
if experimental means the manipulation of 
independent variables and the measurement 
of their effect on dependent variables and 
altruism means nonreinforced aiding re- 
Sponses. Rather, the evidence was of a de- 
scriptive nature; it was based on controlled 
observation of laboratory animals. The only 
study that focused on a dependent variable 
relevant to the understanding of altruism 
(Nissen & Crawford, 1936) was careful not 
to extend its conclusions beyond what it 
examined—food sharing in chimpanzees, 
Turning to the studies in question, although 
the protective behavior of the chimpanzee 
reported by Hebb (1949) was observed in an 
experimental Setting, the experiment was 
actually concerned with the reaction of chim- 
panzees to threatening and nonthreatening 
stimuli. It occurred in a chimpanzee during 


an aggressive exchange between her partner 
and a provocative experimenter, 
was made to 


at observed by 
9) seems most appropriately ana. 
erative because ; 


in which two 


or more parties help one another reach a goal 


beneficial to both. 

Behavior similar to the protective responses 
of Hebb's chimpanzee is common in hu. 
mans. Campbell (1965) reviewed research on 
ingroup solidarity in the face of outgroup 
aggression. Krebs (1970) treated Campbell’s 
(1965) review as “cogent” support for the 
notion that altruism is not inconsistent with 
the theory of evolution, 

Studies mentioned by Hebb and Thomp- 
son (1954) Presented descriptions of the 
behavior of Captive porpoises, Although 
Some of the behavior Seemed altruistic, no 
tests were made of the Motivational ante- 


eil 
cedents of the acts. A further study 0M 
to by Hebb, 1971 (Siebenaler & Author 
1956) was experimental, but, by its eration 
description, it was concerned with coop | 
among porpoises. 936 
A final study (Nissen & Cromford, a 
supplied unequivocal evidence X di 
zees share food with one another. ee 
however, show that food sharing b food 
on altruistic motivations. The WR termi- 
sharing might have been directed at in mu 
nation of a noxious begging response, 
the same way as the bar-press m 
the Lavery and Foley (1963) rats : 
1970, p. 261). To quote Nissen and 
(1936): 


Responsiveness to begging may be © ss 
probably related to reactions given 1n which s¢ ing 
for help and appeals for sympathy, lus of begg od 
of the presumably unpleasant stimu with delay! 
by the partner, or (2) an exchange 

reward . . . [p. 416]. 


d not, 
ase 


c 
rio] 


(1 ) se | 
res 


9r previously received reward relative 
(And unsolicited food sharing a nly 2997. 
rare. Of all the food available, wis past } 
3% was thrown, and about 946 recip’? o 
and almost all of it was passed on E food & 
basis—chimpanzees, in effect, trac Nisse” 
a “ ‘credit? or deferred basis”—S€° ini 
Crawford, 1936, p. 415.) that 7? ive 
To summarize, it is argued mn ec 
human research has failed to sunt puis pe 
evidence for the existence of * ieruistic af 
though incidences of seemingly 2 oises “et 
havior in chimpanzees and eine it mest | 
dogs and rats) have been song d ind hind | 
be concluded that “the reasons hose | 
acts are almost as ambiguous as t 70: 


t 
0 
2 0) 
the behavior of humans [eee qe 
262].” The fact that ee that pet 
formally socialized does not mean 5j j 


i e 
a ded : rositY- , pm 
receive “no training in genet d punish 
are many sources of reward an 
besides socializing agents. 


REPLY TO HEBB 


Perhaps too much stress has been put on 
the deficiencies of past studies. What type 
0f evidence is needed to establish the existence 
Of altruistic behavior? Hebb (1971) sug- 
gested two sources of altruism—imitation and 
empathy. The effect of modeling on altru- 
istic responses has been discussed elsewhere 
(Krebs, 1970, pp. 267-276). The effect of 
empathy warrants further attention. 

Research on empathy supplies a way of ex- 
Perimentally supporting the existence of altru- 
ism and helping to explain it. Several studies 
on rhesus monkeys have reported empathic- 
like responses, but a recent study on humans 
has supplied the strongest support for the 
notion that empathic reactions mediate altru- 
istic responses. Mirsky, Miller, and Murphy 
(1958) found that rhesus monkeys who ob- 
Served other monkeys emit fear responses per- 
formed a previously extinguished avoidance 
response, Miller, Banks, and Ogawa (1963) 
found that signs of anticipatory fear in one 
of a pair of monkeys served as a cue to his 
Partner to press a lever to defer a shock to 

oth. And, in a similar situation, Miller, Caul, 
and Mirsky (1967) found that normally 
Teared (but not socially isolated) monkeys 
Were able to learn to avoid shock by watch- 
Mg the facial expression of a normal (but not 
Socially isolated) partner. Of considerable in- 
terest was the finding that the avoidance re- 
Sponses of the normal monkeys were accompa- 
nied by a psychophysiological sign of emo- 
tional arousal—cardiac acceleration. It is 
Possible that the monkeys reacted to the in- 
lormation that they might receive a shock 
Tather than to the distress of their partner. 

Toreover, the behavior of the monkeys was 
More cooperative than altruistic. However, a 
further study found quite clear evidence that 
distress cues are noxious stimuli to rhesus 
Monkeys. Masserman, Wechkin, and Terris 
(1964) found that monkeys consistently re- 
frained from pulling a chain that brought 
them food in order to avoid giving a shock 
9 their partners. Even though the monkeys in 
the Masserman et al. (1964) study could get 
ood on alternate trials by pulling another 
Chain, the evidence suggests that monkeys 
"act empathically to signs of distress in their 
ellows and that their empathic reactions 
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mediate altruistic responses. The evidence at 
the human level, however, is much stronger. 

It was suggested earlier that the criterion 
of altruism need not exclude reinforcement. 
In particular, it is thought that empathically 
or vicariously experienced reactions supply re- 
inforcement for behavior that can be consid- 
ered altruistic. The notion is elaborated else- 
where (see Footnote 2). In short, it suggests 
that if observers experience the joys and 
sorrows of others vicariously, then in some 
cases it may be more rewarding for them to 
give goods to another (and experience his 
pleasure vicariously) than to keep the goods 
(and experience a lesser pleasure directly). 
In this conception the problem of proving the 
absence of reinforcement is avoided, and the 
power of hedonism is brought to bear on an 
essentially nonhedonistic response. The con- 
ception is similar to that of Aronfreed (1968, 
pp. 138-149). 

Three experiments support the notion that 
empathic reactions can mediate altruistic re- 
sponses. Aronfreed and Paskel (reviewed by 
Aronfreed, 1968) and Aderman and Berkowitz 
(1969) created situations designed to induce 
empathy and found that subjects exposed to 
these situations displayed a disposition to 
sacrifice goods and perform helping responses. 
The empathic reactions of the altruistic sub- 
jects, however, were never checked—the in- 
vestigators only assumed that their manipula- 
tions produced empathic responses. A recent 
study (see Footnote 2) attempted to induce 
and measure positive and negative empathic 
responses and to measure their effect on altru- 
istic behavior. Psychophysiological reactions 
of observers who watched a performer appear 
to receive rewards and punishments were 
measured; then the observers were given a 
chance to help the performer. It was found 
that subjects who were led to believe that 
they were similar to the performer (past 
studies, e.g.. Stotland, 1969, have shown that 
perception of similarity increases the dispo- 
sition to empathize) evidenced the greatest 
psychophysiological reactions to his plight (as 
measured by changes in heart rate, blood 
pulse volume, and skin resistance). They also 
reported experiencing most empathy, and they 
behaved most altruistically. The results were 
interpreted as supplying experimental support 
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for the notion that humans vicariously ex- 
perience the reactions of similar others and 
that these vicarious reactions supply rein- 
forcement for altruistic behavior. It should be 
clear that even though the altruism in ques- 
tion is a function of reinforcement, the results 
do not imply a hedonistic self-oriented view of 
human nature or human motivation. If any- 
thing, they suggest a revision of the material- 
istic conception of human nature based on 


conceptions of reinforcement popularized by 
infrahuman research. 


gued that descriptive 
ent basis for conclu- 
in infrahumans or hu- 
dies reviewed by Hebb 
) established the exis- 
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PARTIAL REINFORCEMENT: 
A SELECTIVE REVIEW OF THE ALLEYWAY LITERATURE SINCE 1960* 


DONALD ROBBINS 2 
The Rockejeller University 


Studies using rats in an alleyway are reviewed with regard to the partial 
reinforcement acquisition and extinction effects. The magnitude of these effects 


is complicated by 


the fact that certain variables have differential effects de- 


pending on whether the rat is given a partial or a continuous reinforcement 
schedule. Three major theoretical positions, dissonance, frustration, and habitu- 


ation of comp! 


eting responses, are shown to encounter difficulties with a con- 


siderable body of data. It is suggested that extensions and modifications of 
Capaldi's sequential theory, which handled most aspects of the extinction effect, 
may prove to be a viable theoretical alternative. 


. Since the publication of Lewis (1960) 
literature review, updating the Jenkins and 
Stanley (1950) review, there have been an 
extremely large number of studies investi- 
Sating the effects of partial reinforcement. 
his review is even more selective than that 
of Lewis in that only alleyway studies with 
Tats, using positive reward, are dealt with. 
he review is divided into three sections: 
(@) a literature review of the partial rein- 
forcement extinction effect (PREE); (5) a 
teview of the data on the acquisition effect 
(PRAE); and, finally, (c) an evaluation of 
theoretical proposals. Because of the volumi- 
Nous literature, references are made to review 
and summary articles whenever possible. Oc- 
Casionally, references are made to articles 
Published prior to 1960, either because they 
Were published too late to be included in 
ewis’ review or because of their historical 
Significance. Tt should be noted that the de- 
Pendent variable in these studies is almost 
always a speed measure (e.g, start, alley, 
goal speed), although trials to criterion is 
Occasionally used. Furthermore, unlike Lewis’ 
(1960), the present review does not discuss 
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the effects of intertrial interval separately, 
but rather, when pertinent, in relation to 
other variables. 


EXTINCTION EFFECT 


The basic reference is Weinstock (1954, 
1958), who gave different groups of rats one 
trial per day, under a 17%, 33%, 50%, 67%, 
83%, or 100% reinforcement schedule. He 
found an inverse relation between resistance 
to extinction and percentage of reinforced 
trials during training. 

Amount of reinforcement. A number of 
investigators have reported a larger PREE for 
groups receiving large, in contrast to small, 
amounts of reinforcement during training 
(Gonzalez, Roberts, & Bitterman, 1966; 
Hulse, 1958; Padilla, 1967; Roberts, 1969; 
Wagner, 1961), although others have reported 
no difference in the magnitude of the effect 
(Amsel, Surridge, & Hug, 1969). These dif- 
ferences are complicated by findings (as 
reviewed by Sperling, 1965a, 1965b) that 
resistance to extinction and amount of rein- 
forcement are inversely related for groups re- 
ceiving continuous reinforcement but, for par- 
tially reinforced subjects, are either positively 
related (Hulse, 1958; Wagner, 1961) or not 
related (Brooks & Dufort, 1967; Roberts, 
1969). The greater magnitude of the PREE 
for larger amounts of reinforcement appar- 
ently results from a combination of a de- 
crease in resistance to extinction for continu- 
ously reinforced groups and either an increase 
or no effect on partially reinforced subjects. 
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The effect on continuous subjects appears to 
be greater than the effect on partial subjects. 
A further complication occurs when the re- 
sults of a recent study by McCain (19702) 
are considered. He gave four groups of rats 
a 50% reinforcement schedule. The results 
revealed more resistance to extinction for the 
group that received a large amount of rein- 
forcement in contrast to one receiving a small 
amount after 42 acquisition trials. However, 
this difference disappeared for two comparable 
groups that had received 138 acquisition 
trials. 

Number of acquisition trials. Most investi- 
gators report a positive relation 
resistance to extinction after partial reinforce- 
ment and total number of training trials 
(Bacon, 1962; Brooks & Dufort, 1967; Hill 
& Spear, 1963; Lewis & Cotton, 1959; Wag- 
ner, 1961; Wilson, 1964), M. B. Weinstock ? 
found that partial subjects after 60 acquisition 
trials were more resistant to extinction than a 
partial group receiving 30 trials, while the re- 


verse relationship was found for continuously 
reinforced groups. However 

Sperling (1965a, 
tion and numbe 


between 


Mikulka 


3 and Pavlik 
an incre 


ased magnitude in 
number of hours of food 
ased. The result appears 


> M. A factorial study of some 
Variables affecting resistance to extinction under 
peel reinforcement with Spaced trials, Unpublished 


ater. 

PRAE was found) then switched bea ai 
deprivation (the PRAE cee riva- 
demonstrated a PREE under water vs 
tion, perhaps due to es ome (1910) 
Haas, Shessel, Willner, and Rescorla traine 
still found the PREE with groups s an 
under standard deprivation condition 
extinguished while satiated. ule. 

Effects of shifts in reinforcement Ww 
Theios (1962) found no difference i ceiving 
ance to extinction between groups Ie initia 
continuously reinforced trials after ceiving 
training under partial and a group th groups 
partial reinforcement throughout. Bn con- 
were more resistant to extinction hotte a” 
tinuously reinforced subjects. Ras m an 
Surridge (1969) and Harris, Kd that the 
Weinstock (1962) have also foun subjects 
PREE is maintained even though force 
were given a block of continuously utherland: 
trials after partial training. firmed 
Mackintosh, and Wolfe (1965) con a grouP 
latter result and also found that us, rel 
receiving partial, followed by continuo a i 
forcement was more resistant to beet 
than one receiving continuous fO jal in- 
partial, with subjects receiving gee t 
forcement throughout being most Tes! 
Theios and McGinnis (1967) i5 
Sutherland et al. (1965) for pcd this be 
for different acquisition levels. ko repo E 
done, Theios and McGinnis (196 ), resist 
that although both groups were pem su 
to extinction than continuously rein tial group 
jects, the conhinuousshifted topa i a 
Was more resistant than the partia! m Jens? 
continuous group. However, Leung 24 a 
(1968) found that subjects swi che 
a partial to a continuous reinforcer on A 
ule were more resistant to ete oe : 
both a continuously reinforced AD part ne 
groups switched from continuous from Kk 
With the latter groups not i-o E. 
Continuous subjects. Thus, alth REE od 
flicts exist in the literature, à rein fry, 
obtained whether continuously rtial © i 
trials are given before or after a an prese” te 
ule. However, the order of schedis ari? id 
tion is apparently an importan obtaf? 
regarding the magnitude of the 
extinction effects. view 

Reextinction. Some investigators 
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cessive reacquisitions and reextinctions as a 
Special case of partial reinforcement (Jensen 
& Cotton, 1960). Generally, reextinction is 
Slower than original extinction (Cotton, Hill, 
& Clayton, 1967; Jensen & Cotton, 1960), 
and sometimes faster asymptotic speeds are 
Obtained (Cotton et al, 1967). However, 
Capaldi, Leonard, and Ksir (1968) found no 
Change or a decrease in extinction rate for 
Spaced trials and an increase for massed trial 
Sroups for successive extinctions. 

Effects of initial nonreinforcement trials. A 
number of investigators have shown that sub- 
Jects given nonreinforced trials prior to 100% 
reinforced acquisition are more resistant to 
extinction than 100% reinforced groups, a 
PREE-like effect (Robbins, Chait, & Wein- 
Stock, 1968; Spear & Spitzner, 1967; Spear, 
Hill, & O'Sullivan, 1965). Recently, Capaldi 
and Waters (1970) found that a group re 
Ceiving 5 nonreinforced trials followed by 5 
reinforced trials was more resistant to extinc- 
tion than continuous groups receiving 5 or 

O acquisition trials. However, this effect is 
Usually smaller than the PREE so that these 
Subjects are generally less resistant than 
Partial subjects. 

Studies using small numbers of acquisition 
trials, Recently, there have been a large 
number of studies in which relatively small 
numbers of acquisition trials have been fol- 
Owed by extinction trials. Most of these 
Studies have allowed subjects to explore the 
Apparatus or one similar prior to training and 
lave used multiple numbers of food pellets 
Or wet mash as reinforcement. These studies 
have all shown a PREE (see McCain, 1966, 
for prior references; McCain & Brown, 1967; 

adilla, 1967). In addition, some investigators 
Aave given the subjects some small number 

continuously reinforced trials followed by 

à partial schedule (of a few trials) and have 
also shown a PREE (Bowen & McCain, 
1967; McCain, 1966). When prior explora- 
tion of the apparatus is eliminated, the PREE 
aS been found by some (Amsel, Hug, & 
Surridge, 1968; Capaldi & Deutsch, 1967; 
apaldi, Lanier, & Godbout, 1968; Capaldi 
Waters, 1970), but not by Surridge, 
Rashotte, and Amsel (1967). This discrep- 
ancy may be because Surridge et al. ( 1967) 
Sed a 90-second detention period on both 
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reinforced and nonreinforced trials, while the 
other studies have used considerably smaller 
detention periods. Further, with regard to 
exploration, Godbout, Ziff, and Capaldi 
(1968) gave groups of rats nonreinforced or 
reinforced placements in the goal box or a 
neutral box prior to training and found the 
PREE and no effect due to the various expo- 
sure conditions. When a single pellet has been 
given as the reinforcement, along with a small 
number of acquisition trials, the PREE has 
been found with a 15-second intertrial interval 
(McCain, 1968) and a 15-minute intertrial 
interval (McCain, 1969). In contrast, Amsel 
et al. (1968) failed to find a PREE with trials 
spaced at one per day for their single-pellet 
reward group. Thus, a PREE is usually found 
even when only one or two nonreinforced 
trials are given during acquisition, although 
the intertrial interval appears to be a major 
factor in these studies. 

Nonrcinjorced goal box confinement dura- 
Lion. Hulse (1958) found that a change in 
nonreinforced goal box confinement duration 
from acquisition to extinction reduced resist- 
ance to extinction; otherwise, no effect due to 
confinement was found. The former result has 
been replicated a number of times (Capaldi, 
1966b; Capaldi & Poynor, 1965; Stanley & 
Clayton, 1955), although with regard to con- 
finement time per se, some investigators 
report an inverse relation between non- 
reinforced confinement time and resistance 
to extinction (Capaldi, 1966b; Capaldi & 
Poynor, 1965). M. B. Weinstock (see Foot- 
note 3) found greater resistance to extinction 
for a 10-second goal box confinement time 
in extinction in contrast to a 30-second, and 
no effect due to nonreinforced confinement 
time during acquisition. The former effect was 
larger for the continuous groups than the 
partial groups. 

Goal box size. Both Kirkpatrick, Pavlik, 
and Reynolds (1964) and Bradley and Wong 
(1969) have found an increased PREE with 
a large, in contrast to a small, goal box. This 
effect seems to be largely attributable to 
increased resistance to extinction for partials 
for the larger goal box, with little or no effect 
being observed on continuously reinforced 
subjects. Support for this suggestion can be 
found in the failure to find a difference in 
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resistance to extinction between groups of 
rats receiving continuous reinforcement in 
narrow or wide tube alleys and goal boxes 
(Robbins, 1966). Paul (1969), however, 
failed to find an effect due to goal box size; 
it should be noted that she used a shuttle 
alley and also failed to find the usual PREE. 
Goal box cues and activity. Somewhat re- 
lated to the goal-box-size studies is the 
Freides (1957) study, where the color of the 
goal box was manipulated on reinforced and 
nonreinforced trials during acquisition and 
extinction. The PREE was obtained for all 
partial groups although it was larger for 
groups run to (a) a different goal box in 
extinction than acquisition, (b) the negative 
goal box (from acquisition), (c) the same 
goal box as in acquisition than for a group run 
to the positive goal box from acquisition. 
Pilley and Leeming (1966) tilted their ap- 
paratus after the subject entered the goal box 
during extinction and reported obtaining the 
PREE with a trial-to-criterion measure, but 
d measure. Wilson (1964) 
partially reinforced subjects 
oor of the goal box on some trials 


and on a shelf in the goal box on others, 
While two other Sroups were fed on either 
the floor 


or the shelf on their reinfor 
No differences wer 


extinction, Similarl 
Hug (1969) 
approach moy, 


ced trials, 
e found in resistance to 


y, Amsel, Surridge, and 
found that althou 


Speed) w. 
Measures are taken, 
partial subjects are g 
continuous subjects 


up differences in 
reinforcement schedule, i 

Casurement of goal box activity, A 
number of Studies, often Prompted by inter- 


ference notions of the PREE, have attempted 
to measure interfering ü 


and competing re- 
Sponses made by the Subject in the goal box 
and, at times, ; 
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š d 
versals and halts than continuously renio 
ones (Jones & Bridges, 1966, 1967; he cot- 
& Marx, 1965). Also, a high negativ da 
relation apparently exists between spee = 
number of halts and reversals (Jo "ore 
Bridges, 1966, 1967). Partially m anui 
subjects also retrace less than 2 alleyway 
subjects do during extinction in € Mars) 
(Bradley & Wong, 1969; McCoy ontrast, 
1965; Mikulka & Pavlik, 1966). In contre? 
Harris et al. (1962) failed to find r Y akend 
ferences in the variability of the pat hes F 
down the alley between partially and vis were c 
ously reinforced subjects. When subje of 
allowed to retrace the entire eed foun 
alley during extinction, Allison (196 a 
that partially “reinforced subjects ibject$ | 
slower than continuously reinforced = same | 
who, during acquisition, received a group 
total number of trials, but faster than ss buti 
that received half the number of a trials: | 
the same total number of reinforce differ | 
Longstreth (1964) failed to find any ubjerts i 
ences between partial and continuous sj < 
in “backing up” behavior in the gmd that 
Similarly, Robbins et al. (1968) ger. 
partially reinforced subjects took e 
to retrace a section of the goal bo: 
Continuously reinforced subjects. eve 
though partial subjects make fewer eti g 
halts, and, in general, fewer com} subjects 
interfering responses than continuous appe 
in both the alley and goal box, Sn the yj 
to execute these responses (or at ; 
initial one) faster, 


, Baco” 
Manipulations in the start ey ndr P 
(1965) found that subjects blocke r trial? f 


(hei 
Past the start box door on half of ^ conv’ 
Were as resistant to extinction * 1964) be 
tional partial group. Longstreth d that su 
McCain and Power (1966) Ra: 
jects given 50% reinforcement jsplaye S 
box (and 100% in the goal box) di5P ^7 pma 


a 4 

PREE. These studies demonstrate be 5 
nipulations in the start m. o ye d 
generally performed in the goa d noto. 
the conventional PREE. It shoul rt boxe 4 
that as in most studies, the ae colo 
alleys, and goal boxes were the is ve ie 

Drug studies. Studies using drug The E 
vealed extremely interesting effects: ^^ nd 


Se È rats 
eral design is to train groups of 
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either partial or continuous schedules, each 
Subject being injected with either saline or a 
drug, prior to each day's run. Groups are then 
Subdivided during extinction with regard to 
the drug or saline. Generally, a change in the 
drug state leads to a smaller PREE, although 
the effect is still found using amobarital 
(Gray, 1969; Ison & Pennes, 1969) and 
chlordiazepoxide (Iwahara, Nagamura, & 
lwasaki, 1967). Only Gray (1969) reported 
an elimination of the extinction effect, and 
then only when subjects were switched from 
the drug in acquisition to the saline condition 
in extinction. It should be noted that these 
drugs generally reduce the subject's overall 
: Speed. 

Within-subject effect. Amsel and his col- 

leagues, as well as others (see Amsel, 1967; 
Amsel, Rashotte, & MacKinnon, 1966, for 
relevant references), have recently examined 
a within-subject PREE. Specifically, subjects 
received partial reinforcement in a distinctive 
alley (e.g, black) and continuous reinforce- 
ment in another distinctive alley (e.g., white). 
These studies generally reveal a failure to 
find the conventional PREE, perhaps because 
of generalization of extinction from the 100% 
reinforced stimulus to the partially reinforced 
Stimulus. In addition, Rudy, Homzie, Cox, 
Graeber, and Carter (1970) failed to find 
Breater resistance to extinction in the alley 
Where nonreinforced trials were followed by 
reinforced trials in contrast to the continu- 
ously reinforced alley. 
. “Patterning.” Capaldi and his colleagues 
(Capaldi, 1966a, 1967) have introduced an- 
Other major set of variables that emphasize 
Sequential effects of reinforcement schedules. 

TOups of subjects given a single alternating 

0% reinforcement schedule have been found 
9 be less resistant to extinction than a double 
alternation or continuous group (Bloom & 
Capaldi, 1961; Capaldi & Minkoff, 1967). 
n Contrast, Surridge and Amsel (1966) found 
no difference in resistance to extinction be- 
Ween single alternating partial or random 
Partial groups, both being more resistant 
than a continuous group. The random partial 
Stoup showed slightly more resistance (al- 
though statistically not significant) than the 


alternating group; this difference was attrib- _ 


uted to terminal acquisition differences. Per- 
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haps the use of Anderson's (1963) shape- 
function measure, which adjusts for different 
terminal acquisition levels, would help resolve 
the discrepancy. Capaldi and Hart (1962) 
pointed out that a single alternating reinforce- 
ment schedule leads to more resistance, no 
difference, or less resistance to extinction than 
a random partial group, when subjects receive 
a small, moderate, or large number of train- 
ing trials, respectively. 

Other sequential effects. Capaldi (19662, 
1967) and his colleagues have also investi- 
gated the effect, upon resistance to extinction, 
of the number of consecutive nonreinforced 
trials preceding a reinforced trial during ac- 
quisition (N length), the number of different 
N lengths, and the number of each N length, 
while holding the percentage of reinforcement 
constant. Generally, these studies reveal that 
independent of the percentage of partial re- 
inforcement, resistance to extinction is posi- 
tively related to these variables (see Capaldi, 
1966a, 1967, for relevant references; Spivey, 
1967; Spivey & Hess, 1968; Spivey, Hess, 
& Black, 1968, 1969). These effects are 
sometimes interpreted in terms of the number 
of nonreinforced-to-reinforced trial transitions. 
Despite the fact that number of training trials 
may interact with these variables, these 
studies have demonstrated the importance of 
sequential effects on the PREE (Capaldi, 


1966a, 1967, 1970, in press). Recently, 
Capaldi and Kassover (1970) found no 
difference in resistance to extinction be- 


tween groups receiving different numbers of 
nonreinforced trials during acquisition with 
N length and number of nonreinforced-to- 
reinforced trial transitions held constant. How- 
ever, N length was found to result in greater 
resistance to extinction despite a constant 
number of nonreinforced-to-reinforced trial 
transitions. 

Intertrial reinforcements. Another technique 
introduced by Capaldi and his colleagues to 
investigate aspects of sequential theories 
(Capaldi, 1970) has been to give the 
subject a reinforced goal box placement 
after a nonreinforced in contrast to after a 
reinforced trial. The intertrial reinforcement 
effect can be interpreted as affecting non- 
reinforced-to-reinforced trial transitions and 
N length. Generally, intertrial reinforcements 


420 


after a nonreinforced run in the alley result 
in less resistance to extinction than a group 
receiving its intertrial reinforcement aíter a 
reinforced run in the alley (see Capaldi, 
1967, for relevant references; Capaldi & 
Wilson, 1968; McCain, 1966, Experiment IV 
& V; Spivey, Hess, & Aponte, 1968; Spivey, 
Hess, & Klemic, 1968). Lobb and Runcie 
(1967) found that intertrial reinforcements 
were effective in reducing resistance to extinc- 
tion if given 15 seconds but not 1 hour after 
the alley trial. However, Overstreet and 
Campbell (1969) found that they were not 
effective if given 15 seconds but were if given 
30 seconds aíter the alley trial. 

Recently, evidence has been found that the 
intertrial reinforcement effect. disappears after 
a relatively large number of acquisition trials 
(Black & Spence, 1965; Spence, Platt, & 
Matsumoto, 1965). However, Pratt and 
Halpern (1969) gave 72 acquisition trials 
under partial reinforcement and found the 
effect when they equated experiences with 
intertrial reinforcement during acquisition or 
extinction. Since Spence et al. (1965) and 
Black and Spence (1965) gave placements 
after every nonreinforced trial, it is possible 
that subjects eventually discriminated them 


from regular alley runs, and thus the effect 
disappeared. 


Similarly, 


ilton ( 1967) 
reinforced goal 
ont of the food cup. In 
Rudy, and Carter (1970) 
| mall number of acquisition 
trials, intertrial reinforcements eliminated the 
PREE, but with extended training, a PREE 
was obtained, that is, the effect of intertrial 


reinforcements was eliminated. They also re- 
that intertrial nonreinforcements led 
to a PREE with a small number of trials 


addition, Homzie, 
found that with a s 
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subjects given partially reinforced runs a 
alley were more resistant to cnn, hc 
a group receiving continuously reinforce B 
runs and partially reinforced placem E 
These studies suggest that nonreinto qM 
"runs" in the alley and even in the goal 4 
may not be essential for the appearance ies 
the PREE. Perhaps, as suggested by V ich 
(1967), exposure to the goal area is esse! d 
and this may constitute “approach respon 

to the goal. 


ACQUISITION EFFECT att 
Although almost all investigators Pai 
finding the PREE, many have failed MS ade 
the PRAE. The difference in the ae PR 
of the effects is quite large, the PRAP ion 
considerably smaller. The boundary ote rela- 
for the acquisition effect appear to i 
tively widely spaced trials and a Eae are 
large number of acquisition trials. Th «tially, 
actually three acquisition “effects. aed 
partial subjects run slower than conti aster- 
subjects, although ultimately they run -ove 
This is often referred to as the Sro 9593 
effect” (Goodrich, 1959; Haggard, A 
Ross, 1964; Spear et al, 1965; W Medi 
1961; Weinstock, 1958), Asymptotically "^. 
partial subjects run faster than cona sec- 
subjects. The asymptotic difference, t al 
ond effect, is the one that is primarily RAL 
with here, so that reference to the artial, 
will mean faster asymptotic running by P* Some 
in contrast to continuous, subjects. eeds, 
investigators have recorded goal box T and 
in addition to start box and alley spee anin 
have reported faster asymptotic Er no 
Speeds by continuous subjects, that 15, 
PRAE (Goodrich, 1959: Haggard, 
McCoy & Marx, 1965; Wagner, 1961). 
effect is dealt with separately and is ! 


ous 


asic refer 
to as the “goal box” effect. The eit Weir’ 
ence for the acquisition effect is agait latio? 


Stock (1958), who found an inverse ais an 
etween asymptotic start and alley spee ntagt 
the trial block of the crossover, and perce 
of reinforced trials. per 9 
Amount of reinforcement. A oeste ic 
investigators have reported faster asymp E 
Speeds for partial subjects receiving bid. 
Contrast to small, amounts of xv os mz 
(Brooks & Dufort, 1967; Hill & Wa 
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1967; Hulse, 1958; Mikulka, Lehr, & Pavlik, 
1967; Padilla, 1967; Wagner, 1961), al- 
though recently both Amsel, Surridge, and 


. Hug (1969) and Roberts (1969) failed to 


find this using varying numbers of food 
pellets, Of the above studies, only Amsel, 
Surridge, and Hug (1969) and Mikulka et 
al. (1967) found the PRAE. These data sug- 
gest that amount of reinforcement has similar 
effects on continuous and partial subjects, 
Since increasing amount or duration of rein- 
forcement generally leads to faster asymptotic 
running speed under continuous reinforcement 
(Robbins, 1969). However, McCain (19702) 
Tecently reported that a partial group given 
à large, in contrast to a small, amount of rein- 
forcement ran faster after 42 acquisition 
trials, although this difference disappeared 
after 138 trials. The disappearance or reduc- 
tion of the effect of amount of reinforcement 
on both continuously and partially reinforced 
Subjects, as pointed out by McCain (1970a), 
tequires further investigation, and a definitive 
Statement must await such research. 
Deprivation level. Mikulka and Pavlik 
(1966) reported finding the crossover effect 


for only an ad lib group with no differences . 


€tween partial and continuous groups under 
Various deprivation levels. On the other hand, 
Badia (1965) found the crossover effect under 
23-, 113-, and 223-hour food deprivation, and, 
Although partially reinforced subjects ran 
faster asymptotically under medium and high 
deprivation conditions, these differences were 
Not statistically significant. It appears that 
Increasing deprivation level results in faster 
asymptotic running for both partial and 
Continuous subjects, but Badia's (1965) data 
Suggest that the effect on partial subjects 
‘evels off faster than that on continuous sub- 
Jects, Finally, Ross (1964) found that when 
Broups of subjects, after demonstrating à 
PRAE under food deprivation, were switched 
to Water deprivation, the asymptotic difference 
Isappeared. 

Effects of shifts in reinforcement schedules. 
TOups of rats after partial reinforcement 
taining often show an increase in running 
Speed when switched to à 10056 reinforcement 
edule (Harris et al, 1962; Hill .& 
Wallace, 1967; Mikulka et al., 1967; Robbins 
et al, 1968; Sutherland et al, 1965), al 
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though others have failed to obtain this effect 
(Badia, 1965; Rashotte & Surridge, 1969; 
Surridge, Boehnert, & Amsel, 1966; Theios, 
1962). Groups of rats after 100% training, 
when shifted to a partial reward schedule, 
usually show a decrease in speed (Mikulka 
et al., 1967; Sutherland et al., 1965). Badia 
(1965) found that subjects switched from 
partial to continuous or continuous to partial 
converged to one another’s asymptotic level. 
Leung and Jensen (1968) gave groups of rats 
different percentages of reinforcement fol- 
lowed by continuous reinforcement and found 
that they all ran faster than subjects con- 
tinuously reinforced throughout. Similarly, 
subjects switched from continuous to partial 
schedules generally showed no effect or a 
decrease in speed. Thus, subjects switched 
from continuous to partial generally decline 
in speed, and the reverse leads to no change 
or an increase in speed. 

Reacquisition. Successive reacquisition and 
reextinction may be viewed as similar in form 
to the partial reinforcement situation (Jensen 
& Cotton, 1960). Generally, reacquisition is 
faster than original acquisition (Cotton et al, 
1967; Harris et al., 1962; Huang & Huang, 
1970; Jensen & Cotton, 1960; North & 
Morton, 1962), and sometimes higher asymp- 
totes are obtained (Cotton et al, 1967; 
Harris et al., 1962). However, Surridge et al. 
(1966) failed to find higher reacquisition 
asymptotes after 80 extinction trials, although 
Robbins et al. (1968) did. The reasons for 
these discrepancies are not clear. 

Effect of initial nonreinforced trials. A 
number of investigators have found that 
groups of rats given nonreinforced trials prior 
to 100% reinforced acquisition run faster at 
asymptote than subjects receiving 10096 re- 
inforcement throughout (Cotton et al., 1967; 
Logan, 1960, p. 188; Robbins et al, 1968; 
Spear & Spitzner, 1967), while others have 
failed to find this effect (Spear et al., 1965). 
Tt appears that relatively widely spaced trials 
are requisite, or at least helpful, in obtaining 
this particular effect. 

Nonreinforced goal box confinement dura- 
tion. Hulse (1958) found no effect on acqui- 
sition asymptotes due to nonreinforced goal 
box confinement time. It should be noted that 
he gave only 25 acquisition trials, and this 
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may account for the failure to obtain the 
PRAE, so that a definitive statement cannot 
be made. 

Goal box size. Both Bradley and Wong 
(1969) and Kirkpatrick et al. (1964) failed 
to find any difference between partial and 
continuous subjects as a function of goal box 
Size. Goodrich (1959) found the PRAE in 
a small and large goal box, and inspection 
of the curves suggests a bigger effect due to 
the large goal box. In addition, the large- 
goal-box study yielded no differences in goal 
Speed between partials and continuous sub- 
jects, while the small goal box yielded a 
significant “goal box effect,” 

Goal box cues and activity. Wilson (1964) 
found little difference between à partial group 
fed either on the floor or On a shelf in the 
goal box on different trials and a partial group 
fed in only one location through acquisition, 
with a slightly greater speed displayed by the 
latter group. Tacker and Way (1968) re- 
placed the goal box With a large open field 
and found that partial subjects were more 
active after nonreinforced in contrast to rein- 
forced trials, although the differences dimin- 
ished during training, but not to zero. Gallup 
and Hare (1969) reported that partial sub- 
jects were found to be more active (in an 
th reinforced and non- 
continuously reinforced 
found that partiais 
food cup than con. 
forced trials, "These 
reveal that partial sub- 
ve in the goa] box than 


continuous subjects 

Measurement o 
and Marx (1965) 
made more reversals and halts in t 
early in training 
although at asymptote both grou 
making almost no competing responses, A 
similar result has been reported by Jones and 
Bridges (1966), although the crossover effect 
was not statistically significant in 
using satiated rats (Jones 
Goodrich (1959) found that partials stopped 
more in the alley early in 
in training, converge 
continuous subjects, 
Mikulka and Pavlik (1 


he alley 
subjects, 
ps were 
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made slightly more pauses and ag m" 
the runway than continuous subjects u at 
various deprivation levels, except for a i E 
ated condition, where the reverse was w E 
Harris et al. (1962) reported no chene i 
the variability of the path taken by je ui 
subjects on nonreinforced trials during "e 
sition. McCoy and Marx (1965) pe box 
sured pauses and reversals in the goa von 
and found that partials made more than Ec 
tinuous subjects throughout. acquisition. irat 
tial subjects have also been found to a 
faster than continuous subjects on rein "e. 
trials (Allison, 1967), and there has M 
been a failure to find an increase in pus 
time during nonreinforced ww In 
(Allison, 1967; Robbins et al, 19 ^ ine 
addition, Longstreth (1964) adt os 
crease in competing behavior on a ae it 
trials during acquisition. In summer es 
appears that partial subjects parton Are 
competing activity in the runway, ay to 
goal box, and show little change in ti se in 
perform the initial interfering eet, 
the goal box during the course of ae 
It should be noted that of the above = al. ff 
only McCoy and Marx (1965), Harris nd 4 
(1962), and Robbins et al. (1968) E 
PRAE, and that McCoy and Marx Era 
also reported finding the “goal box” efi "Bacon 
Manipulation in the start box. totic 
(1965) found no differences in mis 
acquisition speeds between subjects con- 
Írom the alley on half of their trials, A ain? 
ventional partial group, and a 100% half 
forced group. When subjects are fed 50% o 
of their trials in the start box (and 10 A 
the time in the goal box), they are on st 
Tun faster after nonreinforced, in eie | 
reinforced (start box), trials a. h 
1964; McCain & Power, 1966). oot nt 10 
Tunning after nonreinforcement is simi o 
an effect (the “frustration” effect). Us 
found by Amsel and others in the orted 
alley” (Amsel, 1958, 1962). They reP nov" 
faster Tunning in the second alley after 


he frst 
reinforcement in the goal box of t e 
alley, 


. i for 
contrast to running after rein 
ment in the first goal box. «« elim 
Drug studies. Generally, the PRAES E 
nated with amobarbital (Gray, 1969; Iwasa” 
1963), chlordiazepoxide (Iwahara, Iv 
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Nagamura, & Masiryama, 1966), and ethyl 
alcohol (Nelson & Wollen, 1965), with con- 
trols (both partial and continuous subjects) 
running faster than experimental subjects. 

Within-subject effect. Using the within- 
subject procedure previously described, Amsel 
and his colleagues and others (see Amsel, 
1967; Amsel et al, 1966, for relevant refer- 
ences) have generally found the conventional 
PRAE in the start box and alley, and the 
Teversal in the goal box. 

“Patterning.” Capaldi and his colleagues 
Save subjects a single alternating 50% rein- 
forcement schedule and found a “patterning 
effect,” that is, faster running following non- 
reinforced trials in contrast to trials following 
reinforcement, that is, faster running on rein- 
forced in contrast to nonreinforced trials 
(Bloom, 1967; Bloom & Capaldi, 1961; 
Bowen & Strickert, 1966; see Capaldi, 1967, 
lor other relevant references; Ison & 
Pinckney, 1968; McCain, 1966; Experiment 
IV). The faster running after nonreinforce- 
Ment is reminiscent of the “frustration” effect 
referred to earlier, in the double alley (Amsel, 
1958, 1962). With a double-alternation sched- 
Ule, no patterning effect was found by Bloom 
and Capaldi (1961), although Yelen (1967) 
did find a patterning effect with both large 
ànd small rewards. Generally, when the dis- 
Crimination develops, subjects slow down on 
nonreinforced, in contrast to reinforced, trials. 
However, conflicts with regard to the pattern- 
ing effect exist in the literature. Amsel and 
his colleagues have suggested that the experi- 
menter may inadvertently provide the subject 
With cues with regard to trial type (e.g. 
Cleaning out the goal box only after reinforced 
trials or differential odor cues) and have 
Provided evidence in this regard (Surridge & 

Msel, 1965a). Recently, some studies in 
Which trials were spaced at one per day have 
ailed to detect a patterning effect (Surridge 

Amsel, 1965b, 1966, 1968). However, 
Others have found patterning with one trial 
Der day (Capaldi & Lynch, 1966; Capaldi & 
Spivey, 1964a, 1964b). Capaldi and Spivey 

965) stated that the discrepancy may be 

Slated to nonreinforced trial goal box con- 
ement time, They cited the data of Burt 
and Wike (1963), which showed that non- 

SInforced trial speeds are slower and pat- 
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terning appears earlier the longer the non- 
reinforced trial confinement. In addition, they 
provided evidence that at one trial per day, 
patterning results from 120-second confine- 
ment times but not from 30-second confine- 
ment time (Capaldi & Lynch, 1966). Bloom 
(1967), using highly massed trials, found that 
large reinforcements and large nonreinforced 
confinements produced the largest patterning 
effect. However, with the one trial per day 
procedure, Surridge and Amsel (1968) failed 
to find patterning using three groups that 
had reinforced-nonreinforced durations of 30 
seconds-30 seconds, 30 seconds-120 seconds, 
and 120 seconds-30 seconds, respectively. 
They suggested that perhaps reward magni- 
tude is an important variable in obtaining 
the effect, although pertinent data are not 
available. Amsel et al. (1968) did not dispute 
the findings with highly massed trials, and, 
in fact, data from their laboratory revealed 
patterning with a 20-second intertrial interval 
but not with a 12-minute interval (Surridge, 
1967). They have failed to obtain patterning 
at one trial per day and have pointed out 
many of the possible sources of differential 
cues the experimenter may inadvertently pro- 
vide as a basis for the effect (Surridge & 
Amsel, 1965b, 1968). Bloom and Malone 
(1968), with a 1-hour interval, and Harris 
and Thomas (1966), with highly massed 
trials, reported finding the patterning effect 
when care was taken to eliminate possible 
sources of differential cues. The discrepancies 
are not yet resolved, but some summary 
statements may be advanced. Under highly 
massed trials a patterning effect appears. 
However, with spaced trials the results are 
unclear since relatively long nonreinforced 
and reinforced goal box confinement times in 
some cases lead to patterning (Bloom & 
Malone, 1968; Capaldi & Lynch, 1966; 
Capaldi & Spivey, 1964a, 1964b), while in 
others patterningisnot found (Surridge, 1967; 
Surridge & Amsel, 1965a, 1966, 1968). Obvi- 
ously, additional research is required and, as 
pointed out by Surridge and Amsel (1968): 
“The reason for this discrepancy, apparent or 
real, is still not entirely obvious [p. 108]." 

Recently, studies have been reported (e.g., 
Ludvigson & Sytsma, 1967; McHose, 1967, 
McHose, Jacoby, & Meyer, 1967) that sug- 
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gest that rats may exude differential odors 
when nonreinforced, in contrast to those 
exuded when reinforced, and that rats can 
use odor trails (of the previous subject) as a 
differential cue if subjects are run in squads. 
Based on these studies, Amsel, Hug, and Sur- 
ridge (1969) found that at one trial per 
day, they were able to obtain a patterning 
effect, eliminate it, and reinstate it, depend- 
ing upon whether the reinforcement or non- 
reinforcement of the first rat run each day 
was determined or random. They interpreted 
their results as evidence that odors may be 
used as differential cues when they are 
Systematically correlated with reinforcement 
and/or nonreinforcement, However, their data 
are in conflict with those of Capaldi and 
Lynch (1966), who took care to control dif- 
ferential odor cues and found à patterning 
effect. 

Other sequential effects and intertrial rein- 
forcements. The studies investigating the ef- 
fects of nonreinforced-to-reinforced trial transi- 
tions and N length on resistance to extinction, 
either through schedule manipulation or inter- 
trial reinforcements, have generally failed to 
acquisition differences, 
extinction have been 
ail to detect terminal 


food cup area (Badia, 1965; Goo 
Haggard, 1959 P 


reported no differences in speed 
and nonreinforced trials, 
perimental control when available, rather 


than a statistica] comparison, is 
more desirable. When care ; 


on reinforced 
it seems that an ex- 
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‘vation 
1969) as well as under standard deprivatio! 
conditions (Robbins et al., 1968). 


Some THEORETICAL POSITIONS 


Lewis (1960) summarized the major ws 
retical positions popular at that time. n E 
review, we evaluate three intertrial or d 
Sequential theories (Capaldi, 1970) 061 
& sequential theory (Capaldi, 1966a, I 
1970, in press). Since the extinction -- . 
most robust, theoretical analyses of this e Je 
are evaluated initially, and if the pele, be 
Still viable, then the acquisition effect wi 
dealt with, Fes 

Cognitive dissonance. Lawrence and ry 
tinger (1962) have applied dissonance E 
to the behavior of the rat in the pd 
Nonreinforcement is assumed to produce de 
Sonance because the organism has epen 
effort to reach the goal box (and presu d: rce- 
“expects” reward) and receives no rein ale, 
ment. Since, in the usual partial sche are 
reinforcements and nonreinforcements ag 
intermixed in some random order, the su ur 
is assumed to develop some extra pre 
either for the activity or for the goa sects 
itself. Thus, in extinction, partial suh 
have this extra “attraction,” while conh og 
ous subjects do not, and therefore the -— 
is expected. The patterning effect on dies 
tion, and the other sequential effect stuc y 
reveal that it is the trial sequences, a trials 
simply total number of nonreinforced tinge? 
as suggested by Lawrence and Fes e i 
(1962), that are related to resistan ting 
extinction, In addition, studies demons initi? 
increased resistance to extinction after T pjs 
nonreinforced trials may be damaging t pive- 
view since opportunities for “extra pP 
ness" to be built up are not available then 
it is assumed that nonreinforced trial nal) 
Selves lead to “extra attractiveness. jisitio” 
studies using a small number of acq" aging 
trials, which show a PREE, are dam sde 
for similar reasons. Thus, there is p £ 
able evidence that provides difficulty 
dissonance view of the PREE. el and 

“Frustration”? According to the Ams 1962: 
Spence formulation (Amsel, 1958; ts d 
Spence, 1956, 1960), nonreinforcement? ply 
to the development of Kile we ial 
When they are preceded by reinforce 
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"Thus, frustration develops due to the subject 
not being reinforced after an expectancy of 
reward has been built up. Spence (1956, 
1960) has further argued that partial sub- 
Jects should run faster in the alley because 
they are running under a “frustrative” drive, 
but that continuous subjects should run faster 
in the goal box because they have developed 
higher levels of “incentive motivation." 
Studies using a small number of acquisition 
trials, revealing a PREE, are particularly 
troublesome for frustration theory since not 
Many trials are given, and it would appear 
that there is not enough opportunity for 
"frustration" to be built up. Some theorists 
have suggested that there are two partial ef- 
fects, a small trials effect, and the more 
usual one (Amsel et al., 1968). The latter is 
presumably based on frustration concepts. In 
fact, Amsel et al. (1968) modified frustration 
theory so that one reinforced trial was enough 
to provide frustration by resorting to a 
buildup of reward expectancy through the 
Use of multiple-pellet in contrast to single- 
pellet rewards. However, studies giving initial 
Nonreinforced trials, revealing a “partial 
€xtinction-like” effect have shown, as stated 
by Capaldi, Ziff, and Godbout (1970), that 
“Tf reward is expected here, the source cannot 
be a prior rewarded trial [p. 62].” Brooks 
(1969) recently suggested that the small trials 
PREE may result from differences in the 
amount of “primary unconditioned frustra- 
tion” between continuous and partial groups. 
This presumably results from continuous 
Subjects receiving more reinforced trials than 
Partials, since, in these studies, the total 
number of trials is usually equated. A similar 
interpretation has also been suggested by 
Amsel, Surridge, and Hug (1969). However, 
Capaldi and Waters (1970) found that even 
When the number of reinforced trials was 
equated (five), groups receiving five non- 
reinforced trials either prior to their rein- 
forced trials or following two reinforced trials 
Were more resistant to extinction than the 
Continuously reinforced (five-trial) group. 
hus, without additional explanatory con- 
“epts, evidence against a frustrative view of 
he extinction effect exists. Attention should 
also be brought to the “goal box” effect or 
Goodrich-Wagner” effect, since the reversal 
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of the PRAE in the goal box during acqui- 
sition is often cited as support for the frustra- 
tion position. However, as shown by Robbins 
et al. (1968) and Dachowski and Dunlap 
(1969), when discriminative stimulation due 
to presence or absence of the reward is 
eliminated, the reversal of the PRAE in the 
goal box disappears. These results, in addi- 
tion to the initial nonreinforced trial studies 
and the small number of acquisition trial 
studies, provide a body of data that con- 
stitutes considerable difficulty for frustration 
theory. 

“Competing responses.” The most current 
version of competing response theory is that 
of Weinstock (Robbins et al, 1968; Wein- 
stock, 1954, 1958). According to this view, 
nonreinforced trials provide an opportunity 
for nonfunctional components of the running 
response to drop out, that is, to habituate and 
thereby enable partially reinforced animals 
to achieve a higher acquisition asymptote 
than continuously reinforced subjects and also 
display more resistance to extinction. Despite 
the fact that a number of predicted effects 
on running speed have been confirmed (e.g., 
reacquisition, effect of initial nonreinforced 
trials), there has been a general failure to 
detect the elimination or habituation of com- 
peting responses (e.g., Harris et al., 1962; 
McCoy & Marx, 1965; Robbins et al., 1968). 
In addition, studies using small numbers of 
acquisition trials are troublesome unless one 
is willing to assume that habituation occurs 
at a very high rate. Thus, although speed 
effects are found, problems arise, since the 
basic mechanism has not received independent 
empirical support. As a result, when com- 
peting response theory is specified, as sug- 
gested by Marx (1963), it leads to failure 
x6 validate empirically the major construct. 

Sequential theory. Capaldi (1966a, 1967, 
1970, in press) has recently revived the 
old. aftereffects theory by invoking a mem- 
ory mechanism. Specifically, it is assumed 
that stimuli arising from nonreinforcement 
are maintained through a memory mechanism, 
and when the subject is eventually reinforced, 
these stimuli become conditioned to the 
running response. The patterning studies, 
intertrial reinforcement studies, and the other 
sequential effects studies have all been taken 
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as providing strong support for this view. 
Additional support for the sequential theory 
comes from a series of studies investigating 
Sequence effects on resistance to extinction 
after training with varying amounts of partial 
reward. Generally, large amounts of rein- 
forcement lead to greater resistance to extinc- 
tion under a partial schedule than small 
amounts. However, it has been shown that 
the critical variable is the number of non- 
reinforcements preceding a large amount oí 
reinforcement; this condition leads to greater 
resistance to extinction than groups given the 
same number of small and large rewards and 
nonreinforcements, except in a different order 
(Capaldi, 1970; Capaldi & Lynch, 1968; 
Capaldi & Minkoff, 1969; Leonard, 1969), 
Some have suggested that with massed trials, 
Sequential variables are crucial, but with 
relatively spaced trials either a frustrative 
view (Amsel, 1967) or a contrast view 
(Gonzalez & Bitterman, 1969) is appropriate. 
However, some of the partial reward- 
sequential studies referred to above were at 
one trial per day, Thus, the sequential view 
appears to be most viable with regard to the 
extinction effect. As pointed out by Capaldi 


learned in acquisition than « 
tive acquisition measures," H, 
Subjects learn different things during acqui- 
sition depending Upon sequences, and behavior 


during extinction is the best indicator of what 
was learned in acquisition, 


To summarize u 


€ points out that 


tion, and competing 
because of studi 


does not appea: 


the development of either “ 
Bess?" € 


ther, 


extra attractive- 
r “habituation,” Fur- 
theory Proposed by 
ure the major aspects 
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from reinforcement, had reached asymp P 
It would appear, however, that if both o 
cesses had reached their respective A 
totes, subjects should run faster after a ein- 
reinforced trial, in contrast to after a T! g” 
forced trial. Aside from results of how. 
studies, there appears to be no evident MM 
such a difference (e.g, Goodrich, t all 
Weinstock, 1958). Thus, it appears Ee 
three nonsequential theories evaluated, wi ae 
additional concepts or modifications, as aint 
as Capaldi’s sequential theory, cannot acc the 
for the entire range of data. It may er. 
Case, as pointed out by Weinstock (P adi 
that a position incorporating elements © i 
may be most viable. However, this = a 
undoubtedly be somewhat eumberone cc 
result, for the theorist desiring pars idence 
along with explanatory validity, the Him the 
reviewed here appears to suggest tha oach 
sequential approach may be a viable app” ] as 
and worth considerable, theoretical, as We 
experimental, exploration. +a] theory 
The basic mechanisms of sequential ork: 
may be cast in the following framer o 
Nonreinforcement has specific stimulus Ea 
quences that persist through some sac es 
memory or storage mechanism. rms o 
nonreinforcements lead to a modification 
the stored stimulus consequences, prese 
to some asymptotic strength. Reinfo pr 
leads to the conditioning of stimuli p the 
on that trial to the running responses uy 
Stimulus source is not only external ory 
cues but also the representation (in pne b 
of stimuli arising from nonreinforce ctio” 
Thus, nonreinforcement during m 
leads to stimuli that have been condi p 
to running during acquisition for those ; rein 
given nonreinforced trials followed BY orit 
forcements, In addition, longer nonrein Jl 
trial runs followed by reinforcement, aS qto- 
as increasing numbers of nent to 
reinforced trial transitions, should. T 
Sreater stimulus modification. (arising habit 
nonreinforcement) and thus a apes i 
Strength with regard to the running ve ef- 
Some concluding remarks regarding si udY 
fects of the variables reviewed in this is" 
are in order. Tt is both interesting = nt f 
turbing that some variables have die d 
fects depending upon whether the subj 
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partially or continuously reinforced. Since 
most studies have used a single partial sched- 
ule (50% reinforcement), the generality of 
the former interaction is not known. It would 
Seem unlikely that basically different mecha- 
nisms operate depending on reinforcement 
Schedule. For example, how different is a 90% 
reinforcement schedule from a 10045 sched- 
ule? Thus, a further task is to unravel this 
apparently dichotomous situation. Further 
Investigations and extensions of sequential 
theory may shed light on this problem. 
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This study reviews the literature on in- 
strumentality theories in psychology. These 
theories are distinguished by the hypothesis 
that the behavior of an individual is in part 
determined by (a) his expectations that the 
behavior will lead to various outcomes and 
(b) his evaluation of these outcomes, The 
Success of these theories in predicting behavior 
is evaluated for three distinct areas of research: 


verbal conditioning, attitudes, and industrial 
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"Third, a network is said to exist when at least 
one construct, which is linked to multiple re- 
ductions, is linked to another construct, which 
is also linked to multiple reductions. Each of 
these links may be formulated in a proposition, 
and it is this set of propositions that is required 
for the supportable use of unobservables. 
Fourth, the adequacy of a theory is evaluated 
in terms of its competitive support. A theory 
is competitively supported to the extent that 
data follow naturally from it, and competing 
theories require augmenting assumptions or 
modifications to explain the same data. 
Hypotheses are also evaluated on the basis of 
their competitive support. 

Dulany's (1968) logic of theoretical networks 
may be summarized in five principles: 


1. Theoretical terms are introduced postula- 
tionally by a network of theoretical sentences. 
Each sentence hypothesizes a link between the 
theoretical terms and another theoretical term 
or an operation. The hypotheses may originate 
in common sense, intuition, experimental 
findings, or any other source the investigator 
deems useful. Earlier logics treated links be- 
tween theoretical terms and operations as 
Matters of definition. Following Cronbach and 
Mech! (1956), the present logic recognizes 
that we may want to reject an operation as an 
instance of the construct in question, without 
Changing our idea of the nature of the con- 
Struct. Hence, links between theoretical terms 
and operations are hypotheses to be evaluated 
9n the basis of empirical results. 

2. The network is always open to the addition 
Of new terms and to the specification of new 
linkages between terms that are already in the 
network. Thus, the network is never complete. 

3. Terms get their meaning through connec- 
tions to other variables in the network, which 
brovide the competitive support of the sentences 
™ which they appear. Thus, the meaning of a 
term comes from the variables to which it is 
linked and the supported hypotheses about 
€ach of those variables. 

4. The Specification of the meaning of any 
term is always partial. There are at least two 
reasons for saying this. First, the linkages of 
the term to other terms in the network are 

Ypothetical and, thus, are true only with some 
Probability, Second, because the network is 
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open, new terms may always be linked to the 
term, thereby altering its meaning. This 
principle implies that we may only converge 
on the referents and referential meaning of a 
term. This is in contrast to the behaviorist 
position, which claims to define terms entirely 
by the operations to which they are linked 
(Maltzman, 1966). 

5. Confidence in the referential meaning of a 
term is as strong as the competitive support of the 
network or theory in which it appears. This 
principle demands that theoretical terms be 
placed in “richly articulated" networks in 
order for them to be adequately supported. 
The greater the number of terms that the term 
of interest is linked to, and the greater our 
confidence in this set of linkages, the greater 
our confidence in the meaning of the term. 


What does this logic imply about the use of 
terms that refer to unobservables? First, terms 
referring to unobservables may be placed in 
the network in the same way as any other 
theoretical terms. Hypotheses concerning the 
causal status of terms such as expectation or 
intentions are linkages between the term and 
other terms. Support for the hypothesis is 
based on the competitive support of the net- 
work in which it appears. 

Second, this approach does xot suggest that 
constructs referring to unobservables need not 
be grounded in some form of observable meas- 
urement. All terms (both observable and un- 
observable) are linked, though hypothetically, 
to operations. Operations are therefore seen as 
tentative reflections of these constructs. Sup- 
port for the hypothesis that an operation is a 
measure of a certain construct is provided to 
the extent that the measure is related to meas- 
ures of other constructs in the network in the 
way that our theory says it should and should 
not be. Thus, explicit definition of hypothetical 
unobservables by one or a set of operations is 
rejected. The bootstrap quality of the de- 
velopment of both constructs and their meas- 
ures is acknowledged in this logic. 

Instrumentality theories, as was mentioned 
before, suggest that behavior is in part caused 
by (a) one’s estimate of the probability of the 
behavior leading to some outcomes and (b) the 
evaluation of these outcomes, Both components 
of these theories refer to internal processes, and 
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therefore any attempt to support this idea must 
use a logic that allows the investigator to sup- 
port both observable and unobservable vari- 
ables as causes of behavior. We do not mean 
to imply that all the authors reviewed in the 
study are proponents of this logic, but rather 
that this logic is both necessary and sufficient 
for supporting these theories. 

While it is clear that this logic has implica- 
tions for the controversy concerning the status 
of cognitive concepts in learning theory, 
implications for other areas of psychology m 
not be as clear. Our review Shows that in- 
Strumentality theories have been used more 
extensively in the social and industrial areas of 
psychology than in learning. This is partly due 
to the fact that the behaviorist eschewal of 
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sJope 
Dulany (1961, 1962, 1968) has devel P, 
a theory that includes instrumentalittes. in 
theory is intended to account for pehn tio 
Verbal conditioning and concept wem. i- 
experiments. The theory specifies the aes ft 
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is concerned with the immediate pot wi 
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RHd = a “hypothesis of the distribution of 
reinforcement”; that is, the sub- 
ject’s hypothesis that the occurrence 
of the particular response will lead 
to a certain event or class of events; 

RSv = the subjective value of the rein- 

forcement; that is, the subject’s 

evaluations of (or attitudes toward) 
those events; 

The subject’s “behavioral hypothe- 

sis"; that is, his belief as to what he 

is expected to do, or what he should 
do in the situation; 

MC = the subject's “motivation to com- 
ply"; that is, how much the sub- 
ject wants to do what he believes is 
expected of him; 


BH 


ll 


y and w = beta weights, which may take 
values between — 1.00 and + 1.00. 


This theory says that a subject’s behavioral 
intention is determined by an instrumentality 
Component and a social component. The 
instrumentality component is (RHd)(RSv). 

| RHd is the subject’s hypothesis about the dis- 
tribution of reinforcement; that is, his percep- 
tion of the relationship between his response 


and the consequences of the response. It is 
multiplied by RSv, the subject's evaluation 
9f the consequences. The second component 
deals with the normative or social influences on 
the subject’s behavioral intention: (BH)(MC). 

H is the subject’s behavioral hypothesis, 
What he thinks he is expected to do in the 
Situation, This term is multiplied by MC, which 
|. |S the subject's motivation to comply with 
What he thinks he is expected to do. 

The subject's actual response is caused in 
Dart by his behavioral intention (BI). As 
Dulany uses the term, BI refers to a specific 
Intention to engage in a specific act in a given 
Situation. Other investigators have used this 
term to refer to general response sets, which 
may be evoked in a variety of situations 
(Triandis, 1964), : 

According to Dulany's theory, situational, 
Personality, and attitude variables affect be- 

‘vior and behavioral intention only through 

€r effect on the social and normative com- 

Ponents of the theory. For example, rein- 
| 


Orcement contingencies affect the subject's 
Ypothesis about the distribution of rein- 
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forcement (RHd) and his behavioral hypothe- 
sis (BH). These in turn affect behavioral 
intention in the manner specified above. 

In practice, Dulany used multiple linear re- 
gression to estimate the weights for the two 
components. This means that the contribu- 
tions of the two components to the prediction 
of behavioral intention and behavior can vary 
over situations, behaviors, and individuals. The 
weights are mathematically determined, de- 
pending upon the best empirical fit to predict 
behavior. For example, in situations of high 
group salience, the normative component may 
account for most of the variance in behavioral 
intention and behavior. 


Empirical Support 


Two studies (Dulany;* Dulany, Schwartz, & 
Walker‘) provide support for this theory. In 
the first study, Dulany related experimental 
manipulations and the components of his 
theory to behavior in a verbal conditioning 
experiment. Female subjects were seated in a 
12 X 7 foot chamber that was kept at a con- 
stant temperature of 110 degrees centigrade 
(C.) and a relative humidity of 3595. The 
subject was told that she would perform a 
verbal task under simulated climatic condi- 
tions. The subject's head was kept stationary 
with a headrest so that a stream of air from a 
nearby tube could always arrive at the side of 
her face. The task consisted of choosing and 
reading one of two sentences printed on a card, 
for 100 cards. 

For example, one card contained the following: 


'The frog suddenly hopped along a slick log and dis- 
appeared into a pond. 

The frog quickly hopped over the slippery log and 
vanished into a pond. 


Another example is: 


The frog quickly hopped along the slick log and dis- 
appeared into the pool. 

The frog suddenly hopped over a slippery log and 
vanished into the pond 


3 Dulany, D. E. The separable effects of the informa- 
tion and effect conveyed by a reinforcer. Paper pre- 
sented at the meeting of Psychonomic Society, St. Louis, 
October 1964. 

5 Dulany, D. E., Schwartz, S., and Walker, C. Why 
the affective and distributional parameters of rein- 
forcement interact. Paper presented at the meeting of 
the Psychonomic Society, Chicago, October 1965. 
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In all cases, a stream of air followed. the 
reading of sentences containing the articles 
“a” before “slippery” or “slick.” Since each of 
the other words in these sentences appeared 
with the word “a” with a certain frequency, 
the probability of obtaining a blast of air for 
using each word could be specified. Table 1 
presents the probability of obtaining a blast 
of air that was associated with the use of each 
of the words when taken over all sentences, 

The design was a 3 X 3 factorial correspond- 
ing to three levels of reinforcement value and 
three conditions of instruction about the sig- 
nificance of the reinforcement. There were 12 
subjects in each of the resulting nine cells. The 
air blast associated with the use of “a” could 
be 40 degrees cooler than chamber tempera- 
ture, at chamber temperature, or 40 degrees 
warmer than chamber temperature. That 
these reinforcements differed in value was sub- 
stantiated by ratings of pilot subjects. On an 
11-point bipolar scale that ranged from + 1 
(pleasant) to — 1 (unpleasant), mean values 
were .81, — .02, and —.84 for the three condi- 
tions of reinforcement. The three instruction 
conditions consisted of telling subjects that the 
blast of air (a) meant that they used the sen- 
tence correctly, (b) had nothing to do with the 
sentence they chose, or (c) meant that they 
had chosen the wron g sentence, į 
} The behavior to be predicted was the sub- 
Jects responses on the last 20 of 100 trials. 
theory, the experi- 
> experiment should 
ough their influence 


TABLE 1 


PROBABILITY or REcktivING Brast or AIR ASSOCIATED 


WITH WORDS IN THE DuraNv EXPERIMENT 
Attributes Probability value 
a 
suddenly ý m 
along 8 
slippery A 
pool 6 
disappeared 5 
vanished 5 
pond 4 
slick | 3 
over 2 
quickly 1 
the 0.0 


TERENCE R. MITCHELL AND ANTHONY BIGLAN 


on the two components of the theory and i, 
influence of these components on behavio ns 
intention. This means that experimental ue 
ditions should not be related to final pee 
ance when the propositional control varia 
are partialed out. da e 
After completion of 100 trials, subjects M. 
led from the room and given a questione s. 
to assess their awareness over the last 20 be 
Subjects’ behavioral intention as well ‘with 
other parts of the equation were measure dm 
a questionnaire. Thus, subjects’ hypot tf 
about the distribution of the reinforcer (Rsv) 
and their evaluation of the reinforcer nr 
Were assessed for the first component © they 
equation, and what subjects thought their 
Should do in the situation (BH), and tioy 
desire to comply with what they thought we 
should do (MC) were measured. pe DE 
taken to present questionnaire comp 
as an essential part of the experiment. were 
Two experimenters were used; subjects | the 
tested from early winter to late spring, aP were 
positions of questions in the questionnaire nce 
Scrambled. None of these factors influe 
the variables in the equation. , The 
The results support Dulany's theory: ip 
multiple correlation between behavioral 
tention and the two products (RHd X pe 
and (BH X MC) was .88. The beta WS re- 
for the two predictors were .19 and . ation 
Spectively, indicating that in this pu are 
subjects’ hypotheses about what the) eter” 
supposed to do are the more important C5. 
minant of their behavioral intentions. iig, 
viously, for this approach to have any ith 
behavioral intention must be correlatet d 
behavior. The correlation in this case bs the 
The results of this study underscor ig 
importance of cognitive variables in accou? on 
for the behavior of human subjects. ation? 
together, the two experimental manip actu! 
accounted for 36% of the variance 1D, rt re 
behavior. This is in contrast to the 8876 r bY 
Sponse variance, which was accounte : 
behavioral intention, c cott een 
We suggested above that if Dulany P pe 
that experimental manipulations influ vis L e 
havior through their effect on coent E. 
the relationship between manipulate” eva? 
behavior should be zero when pue du 
cognitive terms are partialed out. This 
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fact, the case; experimental manipulations were 
not significantly related to behavior when BI 
and RHd were “at zero or partialed out.” 

Although these results support Dulany's 
theory, they do not strongly support the in- 
strumentality hypothesis, namely, that a 
person's behavior is the result of its perceived 
consequences and the evaluation of those con- 
Sequences. A multiple correlation between the 
components of the theory and behavior was 
not reported. However, in predicting be- 
havioral intention, the beta weight for the 
instrumentality component (RHd)(RSv) was 
only .19, and the correlation coefficient. was 
440. These data indicate that the perceived 
instrumental relations in this situation were 
not substantially related to behavioral in- 
tention, The beta weight of .81 for the norma- 
tive component (BH)(MC) showed that the 
subjects’ perceived expectations were a more 
potent influence on his intentions than were 
the instrumentalities. 

Dulany et al. (see Footnote 4) conducted a 
Study similar to the one just described in order 
lo see if the instrumentality component was 
more highly related to behavioral intention and 
behavior when subjects! behavioral hypotheses 
Were fixed at zero. Fixing BH at zero fixes the 
(BH)(MC) component at zero and brings the 
contribution of this component to the predic- 
tion of BI to zero. Relevant experimental pro- 
Cedures were the same as those described above, 
except that all subjects were instructed that 
the air blast had nothing to do with the 
Sentence that they chose to read. For subjects 
Whose behavioral hypothesis was at zero, the 
Correlation between (RHd)(RSv) and BI was 
-72. The correlation between BI and behavior 
Was .90 in this study. 


Discussion 


Taken together, these studies provide sub- 
Stantial support for Dulany’s theory, but only 
Partial support for the basic instrumentality 
Conception. It appears that in many situations, 
normative or social factors are more important 
9r the subjects’ intentions and behavior than 
are the subjects’ perceived instrumentalities. 

Dulany’s use of multiple regression presents 
two problems, First, its use in the prediction 
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of BI and behavior from the theory of pro- 
positional control demands that all subjects 
weigh the two factors (RHd)(RSv) and 
(BH)(MC) in the same way. That is, the 
empirically determined weights'are the mathe- 
matical best fit for a set of subjects. Thus, 
prediction of behavioral intention and be- 
havior will be attenuated to the extent that 
different people weigh these factors dif- 
ferently. A second problem with the use of 
multiple regression is that the weights have no 
psychological meaning. The theory could be 
improved by developing a weighting system 
that reflects the subjects’ own method of 
weighting components. 

Finally, the attempt to generalize this theory 
to less specific situations may prove to be dif- 
ficult. That is, one of the major problems with 
the above approach is that the components of 
the equation change over time. In the studies 
mentioned above, the experimenter is asking 
subjects to recall information about what they 
thought over the last 20 of 100 trials. In more 
general situations where one is attempting to 
predict, for example, work behavior, one may 
find that stable estimates of the variables of 
Dulany’s theory are hard to derive. 


ATTITUDES 


Instrumentality theories have been useful 
in the attitude area in two ways. First, they 
have suggested a way of conceptualizing at- 
titudes. Second, they have contributed to the 
prediction of behavior from attitudes. 

Peak (1955) first pointed out that attitudes 
toward any object are dependent upon the 
object’s utility for attaining goals. More 
specifically, she stated that attitudes are re- 
lated to (a) their instrumentality for aiding 
or interfering with goal attainment and (6) 
the satisfaction derived from reaching those 
goals. Instrumentality is defined as the per- 
ceived relationship between the attitude object 
and the attainment of goals. 

A test of this attitude theory was provided 
by Rosenberg (1956). He hypothesized that 
attitude is related to the algebraic sum of the 
products obtained by multiplying (a) the 
importance of each of a set of values by (b) 
the instrumentality of the attitude object for 
achieving or blocking realization of each value. 
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Symbolically, this hypothesis may be stated: 


where 
A = attitude; 
I; = the rated instrumentality of the at- 


titude object for attaining or blocking 
each of a set of values; 
and 


=< 
I 


= the rated importance of the value; 
number of values. 


Subjects’ attitudes toward the issue of 
allowing members of the Communist Party 
to address the public were assessed by means 
of a standard attitude questionnaire. This 
questionnaire was of the Likert type (Likert, 
1932). Test-retest reliability over a 2-month 
period was reported to be .72. Three to 5 weeks 
after completion of the Likert attitude scale, 
subjects rated the instrumentality of allowing 
Communists to speak for attaining each of 35 
personal values and the importance to them 
of these values as a source of satisfaction. Each 
value could be rated from +10 to —10 on the 
importance dimension. The instrumental rela- 
tionship between letting Communists speak 
and each value could be rated from T 5to—5. 
The sum of the products over the 35 values was 
significantly related to the original measure of 
attitude, X? = 26.33 (p < .001). A second 
Score was computed in the same manner for 
values that were rated by the subjects to be 
Most salient. This score was even more strongly 
related to the original attitude measure 
(X = 45.09; p < .001). This result suggests 
that attitudes may depend on only a small 
number of values that are important to the 
individual. Using a standard set of questions 
for all subjects may be adding superfluous or 
even confusing information. 

Rosenberg (1956) reported successful replica- 
tion of these results for the attitude “allowing 
Negroes to move into white neighborhoods.” 
He did not report the details of the study, 
except to say that it was based on "identical 
measurement operations” and a 
analysis, 

Fishbein (1963, 1965b, 1967a, 1967b) has 
also argued that instrumentalities are related 


parallel 


TERENCE R. MITCHELL AND ANTHONY BIGLAN 


" side 
to attitudes. The formula for his attitud 
theory is expressed as follows: 


n 
A, =2 Bai, 
ied 
where 
A, — the attitude toward the object 2 "m. 
B; — the strength of belief 7 about 0; o 
is, the “probability” or feng se 
ability” that o is associated with $ 
other concept x; ; m 
a; = evaluative aspect of B;; that an 3 
evaluation of the related cone a 
n = the number of beliefs about 0; mw d 
the number of responses 1n us 
dividual's habit-family hierarchy. 
an 
; : n 8 
Fishbein based his theory partly reraent 
analysis of existing attitude measu b 


z t 

techniques. According to him (19659 
most frequently employed techniques ieren 
stone Scale, Likert Scale, Semantic D! e as 
tial) measure subjects’ beliefs about px her 
sociation between the attitude object e 4 
objects, events, or values; and their eva U^. 
of these objects, events, or values. 

ample, on the semantic differential, su p. 
are asked to rate the attitude objec.. du 
series of 7-point (— 3 to + 3) bipolar ac} ised 
scales. The adjectives that are typica y Tan 
were determined by Osgood, Suci, an on an 
nenbaum (1957) to be heavily loaded, a ive 
evaluative dimension, so that one ay ud? 
reflects positive evaluation of the A e 
object and the other negative uar sub* 
good-bad). According to Fishbein, t icit 
ject’s check on the 7-point scale 1 "m 
whether he thinks the attitude a (ai) 
sociated with a positive or negative sta soci 
and his belief about the strength of 25 neck 
tion (Bj). Thus, when the subject vatin 
+2 on the good-bad scale, he is 1? a mU 
that the attitude object is associated ini 
positive state and that the strength of m 
sociation is 2. The subject's total score E a) 
instrument is conceptualized as the sur afi tbe 
his beliefs about the relationship betwee eit 
attitude object and certain positive OT, , pj 


thi? 


or 1 
ibj ect? 


TL O T J 


; iplied 
live states (a; = +1 or — 1) multip qe / 


(b) his perception of the strength E m 
relationships (B; = 0 to 3) (M. A. Fi 
personal communication, June 1969). 


iM 
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To provide support for his theory, Fishbein 
(1965b) has compared subject scores on 
Semantic differential scales with scores that 
were directly based on the subjects’ perceptions 
of the instrumental relations between the 
attitude object and other objects. He had 125 
subjects list what they believed to be five 
characteristics of Negroes. The 10 most fre- 
quently cited characteristics were selected, and 
belief statements were constructed for each 
one (e.g., Negroes have dark skin), Two weeks 
later, 50 subjects rated each characteristic on 
five bipolar evaluative adjective scales (e.g., 
Eood-bad), and each belief statement on five 
Probabilistic bipolar scales (e.g. probable- 
unprobable). Each subject also rated the 
Concept "Negro" on the five bipolar evaluative 


n 
Scales. The correlation between > B;a; and 
i=l 
the direct evaluation of “Negro” was .801 
(p < .001). Fishbein reported similar support 
for leaders’ attitudes about group members 
(Fishbein, 1965a), and voter’s attitudes about 
political candidates (Fishbein & Feldman).* 
Thus, both Rosenberg’s and Fishbein’s re- 
Search provide good evidence that attitudes 
depend on the perceived instrumental rela- 
tions between the attitude object and other 
objects or values and the evaluation of these 
related objects or values. 


Altitudes and Behaviors 


Fishbein (1967b) also has developed a 
theory of the attitudinal and normative 
antecedents of behavioral intention and be- 
havior. Attempts to predict specific behavior 
from attitudes have typically led to non- 
Supportive results (LaPiere, 1934, is perhaps 
the most famous example). Fishbein (1967b) 
pointed out that there are at least two reasons 
Why this should be so. First, we have often 
measured attitude toward a general stimulus 
Object or class of objects rather than a specific 
Object. Second, the behavior predicted may be 
9nly partially related to attitude. That is, 


s : Fishbein, M., and Feldman, S. Social psychological 
tudies in voting behavior: I. Theoretical and methodo- 
gical considerations. Paper presented at the annual 
Meeting of the American Psychological Association 
‘ptember 1963, 
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people may have the same amount of affect 
toward an object, and yet behave differently 
toward it (Doob, 1947). Because of these con- 
siderations, Fishbein's theory assesses specific 
attitudes about specific behaviors, and also 
considers normative influences on behavior. 

Fishbein's theory is essentially an adapta- 
tion of Dulany's theory of propositional con- 
trol. The basic equation is: 


B= BL=[ 3 Bawo + [(NB)(MC) Je, 


i=l 
where 


B = behavior; 

BI = intention to make a particular 
response; 

B; = beliefs about the consequences of 
performing a particular behavior 
(in a given situation); that is, the 
probability or improbability that 
the performance of behavior X will 
lead to some consequence y;; 

a; = the evaluative aspect of B;; that is, 
the subject’s evaluation of y; 
the consequences; 

NB = a normative belief; that is, the sub- 
ject's expectation about what he 
should do in this situation; 

MC = his desire or lack of desire to do 
what he thinks he should do; that 
is, his motivation to comply with 
the norm; 

n = number of beliefs; 

10920, = empirically determined weights. 


However, as stated earlier, Fishbein considered 
that 


A, = Z Bias, 
1 


where A, = the attitude toward the subject o, 
and since, in this case, o is a specific act, he 
considers the following to be an equivalent 
formulation of his theory of behavior; 


B = BI = [Aner Jo + [(NB)(MC) Jen, 


where Ages = attitude toward the act or 
specific behavior. Thus, the first component 
(Acct) is conceptualized as the instrumentality 
part of the equation. In the three studies re- 
viewed in the following section, three different 
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techniques used to this 


component: 


were measure 


1. One scale assessing whether the act leads 
to “good or bad consequences," and one scale 
for the subjects’ confidence or certainty of this 
relationship. 

2. Four bipolar adjective scales as an 

n 


estimate of È B;a;. 


1 
3. Scales assessing the subjective prob- 
abilities of two choices leading to each of the 
four outcomes and the utility of each outcome 
in a prisoner's dilemma game. 


To justify the interchangeability of these 
different measures, it should be shown that 
they are highly correlated and that they are 
related to other variables in the theory in the 
same way. Support for the convergence of the 
bipolar scales and measures of subjects’ 
perceived | instrumentalities was reviewed 
above. In one of the studies reviewed in the 
next section (Ajzen & Fishbein, 1969), the 
authors reported that two different measures 
of the instrumentality component (Measures 
2 and 3) were related to other variables of 
Fishbein's theory in the same manner. How- 
ever, they only present the relevant relation- 
Ships between the bipolar adjective measure 
and the other variables of the theory. 

The second component of Fishbein's theory 
(NB)(MC) is essentially the same as that used 
by Dulany. Fishbein's NB and Dulany's BH 
both refer to the subject's belief about what 
he should do in a given situation. MC stands 
for the subjects motivation to comply in both 
these theories. 


Empirical Support 


Fishbein (personal communication, June 
1969) first used this theory to predict the oc- 
currence of premarital intercourse for a group 
of 14 female and 21 male students. In this 
study, the instrumentality component (Ac) 
of the theory was measured with two questions. 
The first asked subjects whether they felt that 
sexual intercourse led to good or bad con- 
Sequences; the second asked them to rate their 
certainty about the relationship between 
sexual intercourse and its consequences. The 
Second component of Fishbein's theory was 
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measured by asking subjects to indicate o 
they believed they should do (NB) and e 
motivation to comply with what they € 
they should do (MC). Finally, subjects 
havioral intentions were assessed. At the m. 
of the semester, subjects were asked Mee 
they had engaged in premarital sexual in na 
course. To preserve anonymity, ubl 
identified themselves only by indicating t^ 
mother's maiden name. 

Using the two components of t 
Fishbein computed the multiple co 
for predicting actual behavior and b a 
intentions. These results are presente ici 
Table 2. They indicate that the theory lupe i 
females’ intentions and behavior very - dy 
The behavior of males was not signife i 
related to the components of € 
theoretical equation, although the inte wn. 
were. The instrumentality componit, P ioral 
a significant role in the subjects’ be 5) with 
intention; Aset correlated .52 (p < - ales 
males’ BI, and .92 (p < .05) with sem he 
BI. Also, A44 correlated .72 (p < 05) W? xm , 
females’ behavior, and .26 (ns) with the m 
behavior. Since the number of subjects 
small, and the behavior to be predicted i 
dichotomous variable, these results coms 
only fair support for the theory. . 

One problem pointed out by this stu = 
that dyadic situations have different Pi is DY 
ters than situations in which the subjec (pt 
himself. When the behavior in quesse js 
marital sexual intercourse, in this [e m 
contingent upon the actions of aro d 
tentions may be only partially re 
actual behavior. 

A second study by Ajzen? use ; 
theory to predict communication r grouP 
pliance in 48 three-person groups. Bu SA ar 
member was seated at the corner of a t? d 
board whose corners could be raised or xositio? 
by the subject. Information about the P^ nat 


e «++ levels ^ d 
of the board was provided by spirit iste 
I k con?" red 


he theory: 
rrelations 
ehaviora 
in 


dy was 


t 15 
wehbe 
d Fis! om- 


were placed at each corner. The tas* eventet 
of balancing the board. Partitions Pey e 
subjects from seeing each other, but ag 


H sao 
allowed to communicate with written ne of 
; A ; iefs, and the o" 
* Ajzen, I. Attitudes, normative beliefs, ¢ ation: “is, 
diction of behavior: An empirical investiÉ quine 
published master’s thesis, University o: 
Urbana, 1967, 
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TABLE 2 


PREDICTION OF PREMARITAL SEXUAL INTERCOURSE 


Multiple R using 


orrelation with A; 
both components Corre x 


Sex 
BI | B | BI | B 
Females | .93* | .83* 92 | 2 
Males .86* | .29 ,52* | .26 
Note.—B = behavior, BI = behavioral intention. 
t <05 


Each subject could raise or lower his corner one 
step, and communicate with one other subject 
in a given trial (i.e., instruct him to raise or 
lower his corner). The behavior to be predicted 
was the number of instructions sent by each 
subject to Member 1 (on his left) and Member 
2 (on his right), as well as the number of times 
he complied with messages sent by the other 
two members for 20 experimental trials. 

Subjects completed 10 trials in order to 
acquaint themselves with the procedure. A 
pretest questionnaire was then administered. 
Twenty more trials were then completed, and 
a final posttest questionnaire distributed. The 
questions on the pretest asked subjects about 
their intended behavior on trials that were to 
follow. The instruction for the posttest re- 
quired the subjects to answer the question as 
if they were going to participate in the ex- 
Periment again. Both the pretest and posttest 
gathered the following information: 


1. Asa. Attitudes toward the two acts to be 
Predicted. These attitudes were measured on 
four bipolar adjective scales. 

2. NB. Perceived expectations of the other 
two members with regard to the two acts. 

3. MC. Motivation to comply with the ex- 
bectations of the other two members. 

4. BI. Behavioral intentions with regard to 
the two acts. 


, The multiple correlation coefficients for pre- 
licting BI from Act and NB were generated 

oth for the pre- and posttest questionnaires to 
Predict both the number of instructions sent 
and the number of times the subject complied 
En. his two other partners. For these multiple 
ma lations, NV = 144. Note that Age was not 
Nod by obtaining the probability of the 

5 leading to a set of outcomes and the 
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TABLE 3 


MULTIPLE CORRELATION COEFFICIENTS FOR PRE- 
DICTING BI FROM Ast AND NB IN BALANCE 
Board Stupy (N = 144) 


Prequestionnaire Postquestionnaire 
Measure 
Member 1 | Member2 | Member 1 | Member 2 
Instructions | .692* .720* .196* -910* 
Compliance | .647* -702* -898* .839* 


Note.—BI = behavioral intention, Aact = attitude toward 
the act or specific behavior, NB — normative belief. 
*p «.0l. 


evaluation of these actions. We discussed in 
the previous section the problems involved in 
substituting a standard measure for more 
direct measures of subjects’ perceived in- 
strumentalities. Also note that motivation to 
comply (MC) was omitted from these analyses. 
The author reported this variable did not in- 
crease, and often attenuated the multiple cor- 
relation coefficients and was therefore dropped 
from the analysis. The coefficients for predict- 
ing BI from A44 and NB are presented in 
Table 3. 

"These results show that there are substantial 
relationships between some of the variables 
of the theory and subjects’ behavioral in- 
tention. However, since MC was dropped from 
the analysis, the results cannot be considered 
as good support for Fishbein's theory. Indeed, 
to the extent that the inclusion of MC at- 
tenuated the multiple correlations, the theory 
is disconfirmed. Unfortunately, Ajzen (see 
Footnote 6) did not report the multiple cor- 
relations that were obtained when MC was 
included. 

Correlations between Ase and BI are pre- 
sented in Table 4. These results support the 


TABLE + 


CORRELATIONS OF BI WITH Anet IN BALANCE 
Board Struny (N = 144) 


Prequestionnaire | Postquestionnaire 
Measure |————— e 
| Member 1 | Member 2 | Member 1 | Member 2 
Instructions | .409* .568* .524* .802* 
Compliance .605* .652* S12* :187* 
Note.—BI = behavioral intention, Aae = attitude toward 


the act or specific behavior. 
+p «.0l. 
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TABLE 5 


CORRELATIONS BETWEEN SPECIFIC BI AND B IN 
Baraxce Board Srupy (N = 144) 


Prequestionnaire Postquestionnaire 


Measure 
Member 1 | Member2 | Member 1 Member 2 
Instructions | -690* .690* .822* .883* 
Compliance | .126* .246* -446* .526* 


Note.—B = behavior, BI = behavioral intention. 

*p <.01. 
contention that attitudes toward Specific be- 
haviors are related to behavioral intention. 
And, if Fishbein is correct that attitudes in- 
volve instrumental relations, these results may 
be taken as support for the instrumentality 
hypothesis. Correlations between Aset and be- 
havior were not reported. 

The correlations between specific behavioral 
intentions (BI) and actual behavior are pre- 
sented in Table 5. The variance accounted for 
in compliance behavior is low for both the 
pre- and postquestionnaires (0% to 25%). The 
authors suggested that one problem with the 
compliance measure was that some subjects 
received very few or no Opportunities to com- 
ply. A subject could not comply, even if he 
wanted to, unless he received an instruction, 
Therefore, a second analysis examined the 
compliance behavior just toward the member 
who sent the greatest number of instructions. 
The correlations between BI and B for the 
pre- and postquestionnaires were increased 
to .242 and .582 (V = 144), respectively. 
Even though both correlations are significant 


(p< 01), these modifications still leave large 
amounts of variance unaccounted for. 
Player 2 
Cy D: 
Cı | +9, +9 | —10, +10 
Player 1 
D: | +10, —10| —9, —9 
Game 1 


The first number in each cell indicates the 
points which the first player receives, and the 
second number indicates the points for the 
second player. For example, in Game 1, if 
Player 1 chooses Response C, and Player 2 
chooses Response D, then Player 1 loses 10 
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The difference between pre- and pore 
questionnaire responses suggests an addition? 
problem for the theories reviewed in Pa 
study. It is clear from the data that the poa 
test responses did a better job of predicting B i 
and B than did the pretest responses. A € 
ber of possible reasons for this discrepancy Sd 
presented by the author. First, an individua 
BI may changé during the course of the 6 
periment. In other words, the behavior al 
others may change what he intends to 3 
Second, the subject's competence on the F 
may change his intention, and, third, the M. 
ject may attempt to report his Sent t 
the posttest as more in line with gha E 
actually did—a “cognitive bolstering p. 
cess. If cognitive bolstering is — =r ate 
it is unlikely that the cognitions tha te 
measured in this theory could be leer 
havior. In any case, it appears that m 
the components of the theory are not we 
over time. To the degree that this is thé uri) 
the applicability of the theory is, of © 
limited. i 9 

More recently, Ajzen and Fishbein as 
examined the ability of the theory to a the 
behavior in prisoner’s dilemma games. 
two games used by Ajzen and Fishbein sing 
of the two players accrue points by € each 
between two alternative responses. f point? 
game, a matrix indicated the number " obtain 
on each trial which each player could arto? 1 
depending on his response and his P'hoost? > 
response. More specifically, each play ie choice 
C or D and the combination of these 
determines a cell in the matrix : 


Ch Dy 


& | +4, +1 | —12, +12 


—4, ~i 


dites ur A 
à : jte 
points, and Player 2 gains 10 points. A. ant 
tive C is defined as a cooperative ri A 0s 0 
D as a competitive one. If a player 16e 23 ie 
in Game 1, he stands to gain + 9 or se. f 9. 
depending on his partner's ee 5e 
chooses D, he stands to gain + 10 o 
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However, only if both subjects choose C can 
they both win points on a given trial 
(cooperation). 

The matrices for the two games dilfer in the 
Way that payoffs are distributed; the matrix 
for the first game provides far less pressure 
toward competition between plavers. For a 
more detailed description of these games, see 
Rapoport and Chammah (1965). 

Subjects were 96 undergraduates. Each 
Subject played both games. The first game 
Was played for eight trials and was followed 
by the administration of a questionnaire. 
After completing the questionnaire, 10 more 
trials of the first game were played. The same 
Process was repeated for the second game. The 
number of cooperative moves on the last 10 
trials of each game was the measure of behavior. 

The questionnaire assessed each of the 
following: 


1. Ase, the attitude toward cooperation. 
This was measured on four bipolar adjective 
Scales, 

2. Subjective conditional probabilities (SP), 


« the subject's perception of the likelihood of 


Is partner choosing each alternative given 
fach of the choices he could make, for ex- 
ample, the probability of his partner choosing 

; given that he chose C. 

3. Payoff utilities (U). An evaluative meas- 
Ure of each payoff. 

4. Normative beliefs (NB). A measure of 
What is perceived to be expected by one's 
Partner. 

5. Motivation to comply (MC). A measure 
9f the subject’s willingness to comply with the 
berceived expectations of his partner. 


_ It will be recalled that the theoretical equa- 
tion may be written: 


B ~ BI = [Ant Jeo + [(NB)(MO) Jor 


and that Fishbein considers Ase to be the in- 
Strumentality: component, ‘This test of the 
Cory employs two measures of the first 
a Ponent: an estimate of Aget in the form of 
a bipolar scales and a measure of the sub- 
inve expected utility (SEU) of cooperation. 
s was computed by summing the products 
ire subjective probability (SP) of each act 
es its utility (U): SEU = ESP,U, The 
“asure of Age and SEU correlated .63 for 
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the first game and .67 for the second. These 
relationships do not appear to be large enough 
to justify the conclusion that both measures 
reflect the same construct. We indicated above, 
however, that these variables had similar re- 
lationships to other variables of the theory. 

As in the study of the triangle board, MC 
was dropped from the analysis because it 
lowered the predictive ability of the equation. 
Thus, NB was the only variable included in 
the normative component of the theory. 

Theauthors reported correlations between 
BI and B of .85 and .84 for Games 1 and 2, 
respectively. The multiple correlations for 
predicting BI from Asc, and NB were .73 and 
-79 for the two games. The correlations between 
Aact and B were .63 and .70. Table 6 presents 
these results. 

These results are consistent with those re- 
ported in earlier studies. However, since the 
motivation to comply variable was dropped 
from the analysis, these results do not con- 
stitute support for Fishbein's original theory. 
lt appears that the normative component of 
the theory needs revision. Second, the authors 
reported SEU to be more highly related to 
actual behavior than A44 in both Games 1 
and 2 (.70 to .63 in Game 1, and .75 to .70 in 
Game 2), and yet the Age estimate of the first 
component is used throughout their report of 
the study. If attitude toward an act is indeed 
determined by the probability of the act lead- 
ing to certain outcomes and an evaluation 
of those outcomes, why not use this variable 
instead of an estimate? The authors argued 
that Asc should be a better estimate because 
it takes into account a broader number of 
possible consequences of the given act. How- 


TABLE 6 


PREDICTION OF COOPERATIVE RESPONSES 
IN THE Prisoner's DILEMMA GAME 


Correlation | Game 1 | Game 2 
Correlation of Aast and BI | .15* | Jas 
Multiple R for BI | .89* .85* 
Correlation of Aast and B .63* | .70* 
Correlation of SEU and B .70* q5* 
Multiple & for B | .73* .79* 

Note Aia = attitude toward the act or specific behavior, 
BI = behavioral intention, B = behavior, SEU = subjective 
expected utility. 

*p <.01. 
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ever, in the above study, SEU was at least as 
strongly related to B as was Aact. The correla- 
tion between BI and SEU is not reported. 
These discrepancies require clarification. 


Discussion 


Instrumentality theories have been used in 
attitude research in two ways: First, attitude 
itself has been defined as the probability of the 
attitude object leading to various outcomes or 
values and the evaluation of these outcomes. 
Both Fishbein (1965b) and Rosenberg (1956) 
presented good support for this hypothesis. 
Second, Fishbein has included the above 
theory of attitude in a more complex theory 
designed to predict social behavior. This latter 
theory, which includes both attitudinal (in- 
Strumentality) and normative predictors of 
behavior, has received moderate support. 

One point brought out by tests of this theory 
is that social behavior is often dependent upon 
other people's behavior. What one does in 
social situations is contingent upon what 
others are doing or might do. Only in the last 
Study reviewed (Ajzen & Fishbein, 1969) was 
àn attempt made to measure this interdepen- 
dence (i.e, the subjective conditional prob- 
abilities), and this measure was then dropped 
from the analysis. 

The normative component also presented 
problems. The motivation to comply variable 
did not interact with normative beliefs, as the 
theory predicted that it should. The fault may 
be with the conceptualization or the measure- 
ment of NB and MC, or both. 

In Tables 2, 3, 4, and 6, the multiple cor- 
relations using both components to predict BT 
or B can be compared to the zero-order cor- 
relations between the instrumentality com- 
ponent and BI or B. All the correlations 
between BI or B and the instrumentality com- 
ponent are significant except for the correla- 
tion with males’ behavior in the study predict- 
ing premarital sexual intercourse. Also, the 
differences between these multiple correlations 
and zero-order coefficients are relatively small, 
indicating that in these situations, the in- 
strumentality component controls a substantial 
amount of the predictable variance. 
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Investigators in industrial psychology have 
been less reluctant to employ constructs thag 
refer to internal, organismic processes. Ac 
cordingly, instrumentality theory has been 
used extensively in this area. 

Georgopoulos Mahoney, and Jones (1957) 

, y, : 
first tested an instrumentality hypothesis E 
the industrial area. They suggested that wore 
productivity could be predicted from b. 
worker's perception of the degree to vius 
productivity is a path to the attainment a 
personal goals. If the worker sees high " 
ductivity as a path to his goals, then he b. 
be a high producer; if he sees low products 
as a path to these goals, then he will be 2 
producer. i 

Georgopoulos et al. tested this hypoth 
in a household appliances company. They 
tained measures of the perceived instru 
tality of high and low productivity for €2€^ |. 
10 goals. Their sample of 621 workers was b " 
into high- and low-need groups on the gs 
their ranking of three of these goal E. 
“making money in the long run”; "8€ ro- 
along well with the work group”; p 
motion to a higher base rate.” Thus, t! 
need group ranked these items as hi$ 3d 
importance to them than did the low gee 
group. The measure of productivity WS ve of 
on subject reports of the percentage pus had 
below the company-set standard they hich 
reached. This measure was the form mM S the 
productivity was typically dealt with ker 
plant; hence, it was not alien to the WO ow 
Finally, workers were split into high- arity {0 
freedom groups on the basis of their abit ney 
set their own work pace (if they could, © ge 
were in the high-freedom group), un and 
(20-59 years was high-freedom grou m 6 
their amount of experience (more 
months was high-freedom group). J 1 

The results of this study support the isl 
goal path or instrumentality hypothes 
high productivity was associated Wl. t0 
perception that high productivity ond, 
each of the three goals in question. Sec ort? — ud 
workers who ranked a goal as being s vee? i 
to them personally, the relationship an T 
productivity and perceived instrum eel 
was stronger, Third, the relationship 


esis 
ob- 
nen 


tting 4] 


INSTRUMENTALITY THEORIES 


perceived instrumentality and productivity 
was significantly higher for those who were 
classified as “free.” Finally, for workers who 
were both "free" and had a high need on an 
item, there was a stronger relationship be- 
tween perceived instrumentality and pro- 
ductivity than for workers with any other 
combination of these two factors. To sum- 
marize, the above study supports the hypothe- 
sis that productivity is related to its perceived 
instrumentality for the attainment of goals 
and that degree of attraction to the goals and 
freedom of action mediate this relationship. 

In a similar study, Porter and Lawler (1968) 
examined attitudes toward pay as they 
related to the performance of managers in 
industrial and governmental organizations. 
They compared the performance of the third 
of their sample that perceived pay as a prob- 
able outcome of performance with the third 
that saw little relation between performance 
and pay. Performance (as rated by the sub- 
jects and their superiors) was significantly 
higher for the former group. When samples 
were split according to the rated importance 
of pay to the subjects, the relationship be- 
tween performance and the expectancy that 
pay results from performance was stronger 
for the high-importance group than for the 
low-importance group. This suggests that 
performance is highest for those who perceive 
it as leading to valued outcomes and is con- 
sistent with the Georgopoulos et al. (1957) 
study. 

A more complete statement of instrumen- 
tality theory in industrial psychology was 
presented by Vroom. His statement of the 
theory has been the basis for most of the sub- 
sequent work in this area. Vroom (1964) pre- 
sented two models, the first for the prediction 
of the valences of outcomes, and the second 
for the prediction of force toward behavior. An 
Outcome is simply anything an individual 
might want to attain. The valence of an out- 
Come for a person is defined conceptually as 
the strength of his positive or negative affec- 
live orientation toward it. Similar to Lewin's 
use of the term, valence refers to the anticipated 
Satisfaction associated with an outcome, and 
15 distinguished from the value of the outcome— 
the actual satisfaction resulting from attain- 
ment of the outcome. 
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The valence model states that the valence 
of an outcome to a person is a monotonically 
increasing function of the algebraic sum of the 
products of the valences of all other outcomes 
and his conceptions of the specific outcome's 
instrumentality for the attainment of these 
other outcomes. Symbolically, 


V; =f = (Vig), 
k=l 


where 


V; = the valence of outcome 7; 

I; = the cognized instrumentality 
come j for the attainment 
come k; 

V, = valence of outcome k; 

n = the number of outcomes. 


of out- 
of out- 


Cognized or perceived instrumentality is 
defined conceptually by Vroom (1964) as the 
degree to which the person sees the outcome in 
question as leading to the attainment of other 
outcomes. Instrumentality varies from —1 
(meaning that the outcome in question is per- 
ceived as always leading to not attaining the 
second outcome) to + 1 (meaning that the out- 
come is perceived as always leading to the at- 
tainment of the second outcome). 

Although this model can be used to predict 
the valence of any outcome, it has been applied 
most frequently to the prediction of job 
satisfaction. In essence, the model says that 
the worker’s satisfaction with his job results 
from the instrumentality of the job for attain- 
ing other outcomes and the valence of those 
outcomes. In the remainder of this section we 
refer to this model as the job satisfaction model. 

Vroom's second model predicts the force 
toward behavior. The force on a person to per- 
form an act is conceptualized by Vroom as a 
monotonically increasing function of the 
algebraic sum of the products of the valences 
of all outcomes, and the strength of his ex- 
pectancies that the act will be followed by the 
attainment of these outcomes (Vroom, 1964). 
Symbolically, 

n 
F;- = (EV) 
j=1 


= 
where 


F; = the force on the individual to perform 
act i; 


B 
I 


= the strength of the expectancy that 
act i will be followed by outcome 7; 
the valence of outcome 7; 
n = the number of outcomes. 


E 
wA 
ll 


The individual's expectancy is defined by 
Vroom as his belief concerning the probability 
that the behavior in question will be followed 
by the outcome of interest. An expectancy is a 
perceived probability, and therefore ranges 
from zero to plus one. It is distinguished from 
instrumentality in that it is an action-outcome 
association, while instrumentality is an out- 
come-outcome association. While expectancies 
are perceived probabilities, instrumentalities 
are perceived correlations. 

Vroom suggested that this second model 
can be used to predict choice of occupation, 
remaining on the job, and effort. In practice, 
it has been tested with respect to job perform- 
ance, and we, therefore, refer to it as the job 
performance model. Specifically, Vroom stated 
that the force on the individual to exert a given 
amount of effort is a function of the algebraic 
sum of the products of the valence of each level 
of performance and the person's expectation 
that each level of performance will be attained 
by that amount of effort. Note that the amount 
of effort, not performance, is predicted by 
Vroom; effort is considered to be a behavior, 
while performance is an outcome. 

Galbraith and Cummings (1967) elaborated 
and tested Vroom's job performance model. 
They began by distinguishing between first- 
and second-level outcomes. A first-level out- 
come is one which has a valence that the in- 
vestigator is interested in predicting—in their 
case, performance on the job. They defined 
second-level outcomes as events to which the 
first-level outcomes are expected to lead. It 
should be noted that Galbraith and Cummings’ 
definition of second-level outcomes suggests 
that only those outcomes actually expected by 
the individual to result from the first-level out- 
come may be considered. In fact, outcomes not 
perceived by the individual as resulting from 
the first-level outcome may be included in the 
equation, since their instrumental value will 
be zero; and they will, therefore, not degrade 
the prediction of the valence of the first-level 
outcome, 


Galbraith and Cummings described three 
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sources of second-level outcomes that might | 
be expected to result from performance. ur 
the organization as a whole may bw 
money, fringe benefits, and promotions 1d 
performance. Second, the supervisor may ja 
supportive. Third, the work group may 3 
accepting. Essentially, Galbraith and Me 
mings are providing an a priori list of the ou : 
comes they feel are important to the predictio 1 
of the valence of high performance. The we 
lem with this approach is that the investigan 3 
may fail to support the instrumentality | 
hypothesis because they choose the ge 1 
level outcomes that are not important to m 
subjects, father than because of any ! 
adequacy of the hypothesis. ES | 

Galbraith and Cummings did not study b. 
effects of expectancy in this investigatio s 
Rather they attempted to predict UE i5 
for a sample of subjects considered like A 
have expectancies of + 1 about the em ird 
ship between their behavior and high per nti 
ance. Workers were chosen whose epo 
of performance did not depend on ° ds 
workers and whose work pace was not ng 
pendent on a machine. Their sample ad 
sisted of 32 workers in an Indiana ho ity 
equipment plant. Measures of instrumen wo 
of performance for each of the seconc 1 it 
outcomes and the valence of each second- "Thé 
outcome were obtained from subjects. 
measure of performance was obtaine' form 
company records. This measure took the Y 
of a percentage of the company standarc 
a 1-month period. e. 

The instrumentality hypothesis was w^ 
separately for each of the second-level be 
comes. Significant prediction of perform o 
was obtained only for the outcome, e 
tiveness of the supervisor. Neither the T 4 
of this outcome nor the instrumentality 
performance for the outcome was xelate 1 0 
performance by itself, but the interaction. 
the two was related to performance. odel | 
result is as Vroom's job performance P 
predicts. 

The instrumentality hypothesis was step” 
tested for all outcomes taken together. this 
wise multiple regression was used. m jn- 
analysis, the interaction of valence = per- 
strumentality was significantly related and 
formance for both the supportivenes? 


als? 


money outcomes. It should be noted that the 
analysis of these data did not include a test 
of the association between the su of the pro- 
ducts of instrumentality and valence ratings. 
This sum was central to Vroom's statement of 
the instrumentality hypothesis, but Galbraith 
and Cummings (1967) gave no explanation for 
the absence of this test. 

Despite the author's conclusions, we do not 
feel these results provide impressive support 
Tor instrumentality theory. The failure to ob- 
tain strong support of the instrumentality 
hypothesis may be due to a number of factors 
besides the inadequacy of the hypothesis. First, 
Subjects! expectancies that their behavior 
leads to high performance may not have been 
controlled at + 1 as the authors hoped. Ac- 
cording to Vroom's hypothesis, even when 
high performance is positively valent, the 
individual will not exert a given amount of 
effort if he does not expect his effort to lead to 
high performance. Thus, if subjects’ expect- 
ancies were not at + 1, the theory would not 
predict that the sum of the products of the 
valence of second-level outcomes multipled by 
the instrumentality of performance for those 
second-level outcomes would be related to 
performance. 

Second, the a priori set of outcomes used in 
this study may have excluded outcomes related 
to performance. For example, a feeling of ac- 
complishment may be an important outcome 
of achieving high performance for some in- 
dividuals. Negative outcomes, such as getting 
fired and being bored, may also affect per- 
formance. For example, some individuals may 
perceive many positively valued outcomes as 
resulting from performance, but also perceive 
boredom as a result. If boredom is strongly 
negatively valenced, it may lead the individual 
lo forego the possibility of attaining positive 
outcomes. 

A third reason for the failure of Galbraith 
and Cummings’ (1967) study to support the 
theory is that amount of effort and perform- 
nce may not have been related in this sample. 

call that Vroom's (1964) model predicts 
E not performance. Thus, if Galbraith 
Duos. sores had measured workers inten- 

ce O exert effort or actual effort exerted, 

Eun have found that the model 

ately predicted these variables. 


INSTRUMENTALITY THEORIES 


447 


Lawler and Porter (1967) did attempt to 
predict effort from subjects’ perceived in- 
strumentalities. They studied 154 managers in 
five different organizations, and ratings of the 
importance of seven outcomes were collected: 
pay, promotion, prestige, security, autonomy, 
friendship, and opportunity to use skills and 
abilities. A composite measure of the subject's 
expectancies was based on the sum of his rat- 
ings of the degree to which (a) effort, (6) high 
productivity, and (c) good job performance led 
to the seven outcomes. Ratings of effort and 
performance were obtained from superiors, 
peers, and the subject. Multiple correlations 
between the Expectancy X Importance ratings 
for each of the seven outcomes, and the 
supervisor, peer, and self-ratings of effort, were 
27 (p < .10), .30 (p < .05), and .44 (p < .01), 
respectively. Prediction of performance from 
the Expectancy X Outcome scores was less 
successful; the multiple correlations were 
18 (ns), .21 (ns), and .38 (p < .01) for the 
supervisor, peer, and self-performance ratings. 
With seven predictors, these multiple correla- 
tions are hardly strong support for the in- 
strumentality hypothesis. As in the Galbraith 
and Cummings (1967) study, Lawler and 
Porter’s use of multiple correlations is not con- 
sistent with Vroom’s (1964) model (nor with 
their own statement of the model). A simple 
sum of the Importance X Expectancy scores 
would have been the appropriate predictor. 
Moreover, the authors used an a priori set of 
outcomes. The problem with this approach was 
discussed above. 

A study by Lawler (1968) provides support 
for the causal status of subjects’ instrumen- 
talities. Six rewards were rated by 55 managers 
as to their importance. Subjects’ expectations 
of the degree to which effort and job perform- 
ance led to each of the six rewards was also 
rated. Peer, supervisor, and self-ratings of 
performance were obtained at the same time. 
All of these measures were obtained at the 
beginning and end of a 1-year interval. Mul- 
tiple correlations were generated to assess the 
ability of the six Expectancy X Importance 
scores to predict performance. For the super- 
visors’ ratings, these correlations were .44 
(Time 1), .52 (Time 2), .55 (Time 1 Expect- 
ancy X Importance with the Time 2 Perform- 
ance), and .39 (Time 2 Expectancy X Import- 
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ance with the Time 1 Performance). The first 
three coefficients were significant (p < .05). 
The corresponding correlations for peer ratings 
were .52, .30, .45, and .30 (p < .05 for the first 
and third coefficients), and for the self- 
performance ratings they were .43, .48, .65, and 
.52 (p « .05 for the last three). The causal 
status of the expectancy attitudes is supported 
by the fact that Time 1 Expectancy X Import- 
ance measures are more highly related to 
Time 2 performance measures than Time 2 
Expectancy X Importance measures are re- 
lated to Time 1 performance. Again, however, 
multiple correlation with six Expectancy X 
Importance predictors was used, and the 
rewards were not generated by the subjects 
themselves. 

Another experiment by Hackman and Porter 
(1968) avoided some of the problems with the 
above studies. These authors used instru- 
mentality theory to predict the effectiveness of 
82 telephone company service representatives. 
Instead of developing their own outcomes, 
they had 24 service representatives generate 
relevant outcomes. The 14 most frequently 
mentioned outcomes were included, as well 
as 4 negative outcomes which they added. The 
82 subjects were then asked to evaluate each 
outcome and the probability that working hard 
on the job would lead to each outcome. The 
sum of the products of expectations and 
valence was then correlated with 11 effective- 
ness measures. The correlations ranged from 
25 to 40, and 9 of the 11 were significant 
(p < .05). These results suggest that instru- 
mentalities are important, but not the only 
factor related to employee performance. 

Graen (1969) tested Vroom's job perform- 
ance and job satisfaction models and ex- 
amined some factors affecting perceived in- 
strumentality. His results define important 
boundary conditions for instrumentality the- 
ory. In addition, he proposed some useful 
modifications and extensions of the theory. 

Graen began by clarifying the distinction 
between first- and second-level outcomes. A 
first-level outcome is called a work role. A work 
role is defined as a set of behaviors expected 
by the organization and considered appro- 
priate for the occupant of a position in the 
organization. For example, “effective job 
performer," "team member,” and “group 
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leader" are work roles. Work roles are attained 
and maintained through the performance K 
the expected behaviors in such a way tha 
minimal organizational expectations are met. 
Second-level outcomes are called role outcomes: 
They are the outcomes that accrue to the m- 
dividual as a result of his attainment OF 
maintenance of the work role. Graen focuse 
on two work roles: effective performer and jo 
incumbent. Eight role outcomes (second len 
are included in his study: accomplishmen 7 
achievement feedback, salary, human re! k 
lions, recognition, policies and practices, dii 
sponsibility, and working conditions. As 1n s 
Galbraith and Cummings (1967) study, d 
role outcomes were derived on an a priori p. 

Figure 1 presents the job satisfaction and s i 
performance models of Graen (1969). Ac 
ing to him, job satisfaction is a matter le. 
satisfaction with the job-incumbent work Bic 
It is predicted to be a monotonically increase 
function of the sum of the products of the ole 
strumentality of the job-incumbent WORD Be. 
for attaining role outcomes and the vue d 
the other hand, is predicted to be a pos! 3 
function of the valence of the effective p 
former work role and the perceived expecta” 
that increased effort will lead to attainmen 
this work role. 

The study was an experiment: 
in which women were hired for part- E. 
temporary employment by a ae ; d 
pany. In the first task, subjects had es 
certain correlations from a 155 X 155 nY 
and transcribe them. The second task Wa 
same as the first, except that the correla 
were to be rounded to two places before oup 
scription. There were three treatment P J 
The first, the achievement group, W* phe 
warded for prior effective performance rered 
second group, the incentive group, W35 t pet 
money as an inducement for subsequt" roup' 
formance, The third group was a contro Ange t 
the money they obtained was not CO». fiw 
on performance. Subjects were run d «cd } 
Sessions, and the treatments were int ere IP 
the second session. These treatments bs condi 
tended to permit a test of the boundary" {0 
tions of the instrumentality theory ^, jn 
examine the factors affecting rg a 
strumentality. Upon completion of t^ 


. sation. 
al simulatio 
time 


s the 
tion? 
tran" 


| 
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PROBABILITY 
OF THE ACT d ATTRACTION OF 
Z(A X DE ACT EXPECTANCY (E) WORK ROLE 
Attainment of effective Z(A XI) 
performance 
Superior 
effort 
Nonattainment of effec- D(A XI) 
tive performance 
Person 
Attainment of standard Z(AXI) 
performance 
Standard 
effort 
Nonattainment of stand- Z(AXI) 


ard performance 


where attraction of work role equals Graen's extended model of work role attraction (job satisfaction) : 


WORK ROLE 
ATTRACTION 
(SATISFACTION) 


(A X I) 
Work role; 


E c 
Wes 


Work roles 


Person 


Fic. 1. Graen's extended model of t 


Activity, measures of the following were ob- 
tained: the attraction of work-role outcomes, 
the perceived instrumentality of the two work 
Toles (the particular job and "effective per- 
ormer”) for the attainment of each of the 
eight vole outcomes, and the perceived ex- 
b €ctancy that increased effort would lead to 
More effective performance. 

, ^ Separate job satisfaction measure based on 
[s Hoppock Job Satisfaction Blank (Hoppock, 
1935) was used to measure job satisfaction. 


ATTRACTION OF 
INSTRUMENTALITY ROLE OUTCOMES 
(D (A) 
achievement 


recognition 


salary 
achievement 


recognition 


salary 


he probability of an act (job performance). 


Performance measures used were the number 
of items completed (quantity) and propor- 
tion correct (quality). Both a raw score and a 
gain score (pre- to posttreatment) for both 
quantity and quality measures were used in 
the final analysis. 

Perceived instrumentality varied according 
to the treatment condition. It will be recalled 
that the achievement treatment group was re- 
warded for prior effective performance and 
that the incentive treatment group received 
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money as an inducement for subsequent good 
performance. For the achievement treatment 
group, the perceived instrumentality of the 
job-incumbent work role for achievement and 
recognition was significantly higher than it was 
for the incentive and control groups. For the 
incentive treatment group, the perceived in- 
strumentality of the job-incumbent work role 
for money was significantly higher than it was 
for the other two groups. Graen (1969) also 
looked at the perceived instrumentality of the 
effective performer work role. The achievement 
treatment group rated the instrumentality of 
this work role for achievement and recogni- 
tion significantly higher than did the groups 
in the other two treatment conditions. How- 
ever, the perceived instrumentality of the 
effective performer work role for money was 
not greater for the incentive treatment group. 

The results of this study provide partial 
support for the job satisfaction model and 
Suggest some limitations for it. The specific 
model tested in this study states that satis- 
faction with the work role of job incumbent is 
a monotonically increasing function of the 
sum of the products of the valences of role out- 
comes and the perceived instrumentality of the 
work role for these outcomes. The model was 
tested separately for intrinsic outcomes (ac- 
complishment, achievement feedback, recogni- 
tion, and responsibility), and extrinsic outcomes 
(human relations, policies and practices, salary, 
and working conditions), and for all outcomes 
taken together. It was found that the model 
predicted both job satisfaction (i.e., the Hop- 
pock scale scores) and gains in satisfaction for 
the achievement-feedback treatment group 
when intrinsic outcomes and all outcomes were 
considered. These correlations were .36 (in- 
trinsic outcomes) and .37 (all outcomes) for 
the raw scores, and .32 and .29 for the gain 
scores (p < .05 for all coefficients). Since the 
achievement-feedback group was the only 
treatment group for which a contingency be- 
tween the work role and certain role outcomes 
was clearly stated, Graen concluded that the 
job satisfaction model is limited to situations 
where the contingencies between work roles 
and role outcomes are stated in a concrete 
manner. 

The job performance model (effective per- 
former) failed to predict either the quality or 
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quantity raw performance scores. In other 
words, there was no correlation between the 
raw scores and the product of the expectation 
that effort would lead to the role 9 
effective performer and the valued 
of that role. However, the model was partia ? 
successful in predicting residual gain scar 
These scores were obtained by partialing us 
pretreatment performance scores so that b 
variance in posttreatment performance due 3 
ability, and pretreatment motivation 1S nag 
mized. The effective performer model sig 
nificantly predicted these scores for 
achievement-treatment group w : 
and extrinsic role outcomes were eons? con- 
separately and when all outcomes ien tions 
sidered at the same time. These corre d 

were .29, .22, and .28 for one task, ME 
31, and .39 for another (p < .05 for esta 
coefficients). The model did not predict e 
ual gain scores for the other treatment st its, 

Together with the job satisfaction mu 
the just-mentioned results define | 
conditions for the instrumentality ©" wil 
models. Graen suggested that the theory be- 
be successful only when the contingencies rly 
tween work roles and role outcomes are clear’ 
stated. 

These data also suggest, though les 
that the models will only be successfu 
specific role outcomes that are PT fact. 
work role. This conclusion is based accessfully 
that the job satisfaction model nan 
predicted job satisfaction only for the e only 
outcomes, and these outcomes were MR for 
outcomes specifically linked to the job. 2 yee? 
over, in the case of the relationship be e for 
the job performance model and performance, ag 
the achievement group, the correlations they 
higher for the intrinsic outcomes than x 
were for the extrinsic outcomes. 


dered 


Extended Models 


T 
Both Graen (1969) and Porter and Lawle 
(1968) suggested extensions 
strumentality model to pre 
and job satisfaction. The latter aut 
gested that the effort-performance re tion 
is moderated by ability and role percep we, 
variables. The relevant role puse 
concerned with the types of effort an In? 


dict performan ce 


the 
hen intrinsic ' 


of the basic i» | 


hors su£ | 
lationshiP | 
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- believes are essential to effective job perform- 

. ance. They argued that an accuracy of role 
perceptions score should be multiplied by an 
Effort X Ability score to predict performance. 
They presented some evidence that role per- 
ceptions do indeed moderate the effort- 
performance relationship (Lawler & Porter, 
1967; Porter & Lawler, 1968), but no data 
have been presented that measure all three 
variables and combine them in the suggested 
manner. 

|. Graen proposed a number of modifications 
of the models in addition to the distinction 

i between work roles (first-level outcomes) and 
role outcomes (second-level outcomes). First, 
Graen suggested that the focus of the model 
be on gains in performance or satisfaction 
rather than raw scores. Besides being consistent 
with his results, he argued, this focus makes the 
model a dynamic one and provides the model 
with developmental implications. It is not clear 
why the prediction of changes in performance 
or satisfaction is more important than the pre- 
diction of raw scores, but Graen’s modified 
model, if successful, would make explicit the 
factors that are important for increasing pro- 
duction or satisfaction, 

Graen (1969) suggested additional predictors 
of performance and satisfaction. In addition 
to the Expectancy X Valence term, gains in 
performance are predicted from (a) the per- 
ceived expectations of others and pressure to 
comply with the others and (b) the sum of the 
valence of intrinsic consequences of the per- 
formance gain multiplied by the expectancy 
that the gain will lead to these consequences. 
The latter term reflects the subject’s internally 
felt pressure toward increased performance. 

Gains in satisfaction that are associated with 
Sains in performance are predicted from the 
ire model, except that the term referring to 
; hers? expectations is the amount of pressure 
ig Sonant with increased performance relative 
E pressure and the intrinsic instrumental- 

term refers to the amount of internally felt 
fg, Sure that is consonant with increased per- 
“Ne relative to all internal pressure. 
aq can be seen that both Porter and Lawler’s 
that Taen's work are conceptually similar to 
Clade. Fishbein and Dulany in that they in- 
besiq, ther sources of influence on behavior 
"istrumentalities. Empirical support 


451 


for these extensions, however, still must be 
presented. 

In summary, instrumentalities appear to be 
an important factor in predicting the satis- 
faction and behavior of personnel in organiza- 
tional settings. However, if we compare the 
amount of variance accounted for in the 
organization studies with the amount ac- 
counted for in the attitude and verbal con- 
ditioning areas, it appears that the use of 
instrumentalities has been less successful in 
the former area. There are at least two reasons 
why this might be the case. First, the phe- 
nomena that the industrial psychology studies 
are attempting to predict are more complex 
than those predicted in the various game and 
verbal conditioning settings. Second, the 
situations in the industrial settings are less 
controllable than those in the attitude and 
verbal conditioning areas. 


Discussion AND EVALUATION 


The evidence reviewed in this study pro- 
vides fairly good support for the basic in- 
strumentality hypothesis. The hypothesis that 
behavior is related to the perception that it 
will lead to certain outcomes and the valence 
of those outcomes received some support in 
three areas of psychology: verbal conditioning, 
attitudes, and industrial psychology. This 
section discusses the strengths and weaknesses 
of the theories that have been reviewed in this 
study. For the most part, these theories con- 
tain the instrumentality hypothesis and have 
additional components referring to the norma- 
tive influences on behavior. We begin by point- 
ing out some of the favorable aspects of these 
theoretical approaches. 


1. Both the instrumentality hypothesis and 
the more extended theories that out 
provide a systematic framework for the M 
tion of new variables. One might isolate a n 
tional outcomes that am imponim o ra 
havior in particular situations. Simi “ a 
might have different normative ae : 
corresponding to the perceived "E "p 
different groups in the subject eT n 
example, the peer and manage SE 
in the worker's role ee «^ es 
formulations of Dulany, Tu ded Mare 
entirely new components can 
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is reason to believe that they affect behavior 
and behavioral intention. 

2. The theories reviewed here represent a 
major step toward the treatment of the com- 
plexity of social behavior. For many years, 
psychologists have related variables in a pair- 
wise fashion (e.g., need achievement and pro- 
duction), with the net result that we have 
many specific relations and very little integra- 
tion. The theories reviewed above provide a 
structure allowing the psychologist to examine 
the effects of a number of variables that may 
be simultaneously affecting behavior. The 
instrumentality component incorporates a 
whole set of outcomes and probabilities that 
may influence a given behavior. Dulany's, 
Fishbein's, and Graen's theories include the 
expectations of a set of other people and one's 
willingness to comply with these expectations 
along with instrumentalities. 

3. The theories described in this study sug- 
gest important sources of behavioral change. 
The instrumentality component suggests two 
ways in which behavior may be changed. First, 
the subjects perception of the instrumental 
relations between behavior and its outcomes 
may be changed. For example, employees may 
value the rewards provided by the organiza- 
tion but not know which behaviors lead to 
them. Second, behavior may be influenced by 
changing the value of outcomes or organiza- 
tional rewards. These theories also suggest 
that changes in the behavior of the subject 
result from the expectations of others. Thus, 
one might also bring about changes in the 
target subject by changing the expectations 
of those around him. 

4. Finally, in the first section we discussed 
the necessity of using a logic of theoretical 
networks in order to support the contention 
that unobservable constructs are causes of be- 
havior. This logic requires richly articulated 
theories including a network of constructs and 
operations. The instrumentality theories that 
include a normative component and, to a lesser 
extent, the instrumentality idea by itself are 
relatively good examples of how this approach 
to theory building could be used. These 
theories, as pointed out above, (a) can use a 
large number of variables, (5) are open to new 
additions, and (c) have received relatively good 
support when compared to other interpreta- 
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tions (e.g., the comparison with experimental 
manipulations by Dulany). 


There are also a number of problems as- 
sociated with this theoretical approach and 
with the statistical techniques typically em- 
ployed to test it. 


1. The theories demand specific rules q 
combination that exclude the possibility t 
examining certain interaction effects. Two eT 
comes, when they are both perceived a 
sible consequences of an act, may cont Bs 
more to the intentions of an individual E to 
when they appear separately with rep rom 
other behaviors. That is, outcome Boro ity 
may be important. To date, this poss! 
does not appear to have been tested. hs 

2. Many of the theories that we BAe 
above are tested by multiple regress! sion 
their most common use, multiple Ew. 
equations limit the investigator to we M 
weighted linear function as the rule o This 
bination of components for all subjects- gad: 
constraint is not required by the theory; 4 
in fact, it was suggested that the relat! ^ 
of each factor may differ over individuals al i 
situations. A weighting system that EI 
chologically meaningful in the sense tually, 
reflects the way in which individuals ac 
combine these factors is needed. 

3. Even when the theories are able to pr 


wed 
In 


edict 


: ° . 1 ith ex - 
behavioral intentions or behavior W1 


ituation® 
cellent accuracy, there appear to be situat 


: A ; - inctrument^ 
where behavioral intentions or instrum 


g 
alities are not highly related to actual t5 


havior. Fishbein (1967b) has pointed out (P. 
the degree of control that the individua il 
over his behavior is important. In some Cher 
one's behavior may be controlled by cae eae 
person as, for example, when an employ most 
tends to see his boss. These theories pe 
either include variables that mediate t3 r 


> * 5 A a tionships 
havioral intention-behavior relation HP al. 


a 
be qm indivi 
limit themselves to behaviors the 

does control. con 


" | the 
á ioned earlier tbat t 
4. It was mentioned earlie compon M 


structs in the instrumentality ceive 
change over time. That is, Promes Oly 
strumentalities and the value of ou ‘ons ? 


tions 4 
change; so do the perceived expe dictt 
others. When this change cannot Lh a! 
or controlled, behavioral intention 


ive weight | 
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components of the theory may not be related 
to actual behavior. 

5. The preceding three criticisms point to 
the need for a theory of the situation. Such a 
theory should describe those aspects of the 
situation that affect the relative weights of 
the factors in the instrumentality equation 
and that mediate the relationship between be- 
havioral intention and behavior. The authors 
and their colleagues are presently attempting 
to develop a structural role theory (Oeser & 
O'Brien, 1967; Oeser & Harary, 1962, 1964) 
approach to the factors affecting the weights 
of the components in the instrumentality 
equation. a 

6. Many of the theories discussed above use 
a standard set of outcomes for all subjects. 
It is likely that different outcomes are relevant 
for different subjects. Moreover, Rosenberg’s 
(1956) data suggest that attitude is linked to 
only a small number of outcomes and that 
additional outcomes degrade prediction. This 
suggests that subjects should be asked to 
generate their own outcomes. 


7. Finally, we are struck by the possibility 


that the instrumentality component of these 


theories may be culturally specific. Thus, in- 
dividuals who think solely in terms of the 
outcomes accruing às the result of their be- 
havior may not be found in cultures other than 
our own. Similarly, there may be sex differ- 
ences in the applicability of the instrumentality 
hypothesis. 


Jn summary, instrumentality theories are 
being used in à wide range of areas in psy- 
chology. Theories using these concepts have 
generally been supported. Additional work 
with these theories should integrate them with 
a theory of the situation, develop psychologic- 
ally meaningful weighting systems for the 
factors included, and examine the interaction 
of factors as they affect behavioral intention 
“nd behavior. 
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